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ABSTRACT 

Seven  short-term  forecasting  models,  two  using  least 
squares  estimation  methods  and  five  employing  variations  of 
the  exponentially  weighted  moving  average  method,  are  com- 
pared in  their  relative  ability  to  produce  minimum  error 
variance  forecasts  for  seven  simulated  time  series.   Each 
series  was  generated  to  enable  one  of  the  forecast  models 
to  be  the  least  squared  error  predictor.   A  comparison 
methodology  is  developed  which  facilitates  forecast  model 
selection  based  on  single  or  group  series  forecast  per- 
formance through  the  measurement  of  model  specification 
errors.   A  computer  program  is  presented  which  may  be 
modified  to  accepi  ic^I  LxiuS  scries  and  which  perr.:i*s  the 
forecast  models  to  be  ranked  in  order  of  their  relative 
specification  error. 
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TABLE  OF  SYMBOLS  AND  ABBREVIATIONS 

Following  are  the  meanings  of  the  notation  most  fre- 
quently used  throughout  this  thesis. 

Y  =   the  estimate  or  forecast  of  the  observed  value  (Y  ) 

of  a  random  process  {Y  }  at  time  t. 

Y  =   the  estimate  or  forecast  of  the  level  of  mean  value 

(Y  )  at  time  t  of  the  process  (Y.}  made  at  time  t. 
b    =   the  estimate  at  time  t-1  of  the  process  slope  (b  ) 

at  time  t  in  a  linear  model. 
e.   =   a  random  shock  or  superimposed  error  experienced  by 

the  random  process  at  time  t  which  contributes  to  the 

error  in  forecasting  the  observed  value. 
F    =   the  one  step  ahead  forecast  at  time  t-1  of  the  as 

yet  unobserved  series  value,  Y  . 

e.   e   the  one  step  ahead  forecast  error  (e.    =  Y.  -F.")  which 
t  ^  v  t     t   t 

is  realized  when  the  observation  Y   is  taken. 

a    =   the  smoothing  constant  in  a  EWMA  model.   Also,  a  is 
the  portion  of  a  random  shock  which  is  considered  to 
be  a  permanent  contribution  to  process  level,  a  =  1-3. 

3    =   the  discounting  factor  in  an  EWMA  model;  the  rate  at 

which  the  weight  given  to  past  observations  diminishes. 
3  =  1-a. 

a'   =   the  smoothing  constant  associated  with  the  slope  es- 
timator of  a  forecast  model. 

3'   =   the  discounting  factor  associated  with  the  forecast 

model  slope  estimator.   3'  is  not  necessarily  equal  to 
1-a*  . 


=  the  estimate  of  3?,  the  slope  of  the  linear  model  in 

least  squares  methodology. 
=  the  autocorrelation  coefficient  of  the  series  (Y..} 

with  lag  of  one  period. 
=  the  measure  of  model  specification  error. 


I .   INTRODUCTION 

When  a  random  event  has  a  significant  impact  on 
society,  considerable  effort  will  be  made  to  predict  its 
occurrence.   The  ability  to  predict  or  forecast  such  an 
event  is  a  by-product  of  a  quantitative  understanding  of 
the  situation,  a  physical  model.   Prediction  based  upon 
a  behavioral  model  is  the  ideal,  but  forecasts  can  also 
be  based  upon  recognition  of  regularity  as  well  as  upon 
explanation  of  that  regularity. 

There  are  many  situations  in  both  industrial  and 
governmental  operations  which  require  forecasts  for  hun- 
dreds or  thousands  of  routinely  recurring  events.   Often 
Liicoc  evoiits,  sucii  a.3   equipment  failures  which  generate 
demands  for  repair  parts,  are  not  influenced  by  adver- 
tising or  other  factors.   In  the  absence  of  additional 
information  a  projection  into  the  future  must  be  made 
based  entirely  upon  past  observations  of  these  events. 
It  is  frequently  true  that  none  of  the  events,  individ- 
ually, are  of  sufficient  importance  to  warrant  the  study 
and  attention  required  to  develop  behavioral  models. 
For  these  items  of  low  cost  and  nonsensitive  nature  a 
routine  forecasting  system  is  desired  which  employs  the 
"management  by  exception"  principle.   In  evaluating  the 
forecast  accuracy  of  such  systems,  a  cost-effectiveness 
approach  must  be  taken.   A  forecast  model  which  performs 
well  in  one  application  may  be  totally  unsuited  in  another 


where  the  forecast  accuracy  required  is  more  stringent. 
Since  any  forecasting  system  selected  for  use  must  be  sat- 
isfactorily adaptable  to  a  wide  range  of  demand  patterns 
it  is  of  interest  to  examine  a  few  of  the  short-term  fore- 
casting models  which  are  currently  in  vogue. 

This  thesis  investigates  the  types  of  time  series  for 
which  various  forecast  models  are  appropriate  and  the  de- 
gree to  which  their  forecast  performance  is  degraded  by 
changes  in  the  forecast  series  generation  model.   The  re- 
sults of  this  investigation  should  suggest  which  model  or 
models  tend  to  be  most  adaptive  in  the  sense  that  satis- 
factory forecasts  are  consistently  made  for  the  particular 
series  forecast  in  the  study.   It  is  realized  that  fore- 


somewhat  subjective.   Also,  the  results  obtained  will  nec- 
essarily be  highly  dependent  on  the  series  upon  which 
forecasts  are  based,  and  the  series  used  in  this  study  may 
not  be  representative  of  the  many  demand  or  failure  pat- 
terns which  must  be  forecast  in  practice.   One  must  also 
remember  that  the  optimality  (minimum  variance  unbiased- 
ness)  of  even  the  least  versatile  forecast  model  may  be 
demonstrated.   As  Bossons  [Ref.  15]  has  concluded: 

(a)   if  it  can  be  shown  that  the  model  corresponds 
to  a  linear  transformation  of  the  stochastic  process  as- 
sumed to  generate  the  time  series  to  be  forecast  by  the 
model ,  and 
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(b)   if  efficient  estimates  can  be  derived  for  the 
parameters  of  the  stochastic  model  and  thus  also  of  the  rule 
for  "adapting"  the  forecasts,  then  the  optimality  of  any 
rule  can  be  demonstrated. 

The  decision  to  trade  this  possibly  restricted  optimality 
for  versatility  can  only  be  made  in  the  environment  where 
the  model  is  to  be  ultimately  used.   It  should  also  be 
noted  that  the  measure  of  optimality  associated  with  the 
EWMA  models  differs  from  that  used  in  the  least -squares 
models  and  for  this  reason  a  brief  discussion  of  the  dif- 
ference is  needed.   The  EWMA  models  use  the  method  of 
discounted  least  squares  (D.L.S.).   An  excellent  discussion 
by  Gilchrist  of  this  and  other  discounting  methods  is  con- 
tained in  Reference  15.   His  modification  of  i  ne   method  of 
moments,  least-squares  and  maximum  likelihood  to  incorporate 
discounting  may  offer  a  fertile  area  for  future  investiga- 
tion in  which  the  modified  methods  are  compared  to  the 
traditional  procedures.   He  argues  that  the  discounted 
estimators  have  more  robust  properties  than  the  standard 
methods . 

The  method  of  discounted  least  squares  is  normally  used 
with  models  of  the  general  form 

Yt  =  f(t;Xt_1,...,Xo;6)  +  ^ ,       t- Q,l,... 

where  the  fth  observation  is  expressed  as  a  function  of  all 
previous  observations,  a  parameter  6,  and  time  t.   The  es- 
timates of  0  are  chosen  to  minimize  the  discounted  sum  of 
squared  errors 
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W.  el    . 
t  t-i 


i  =  0 
where  W   is  usually  of  the  form  £  ,0<3<1.   In  using  this 
form  of  W. ,  a  weighted  average  results  which  gives  maximum 
weight  to  the  most  recent  data.   The  estimators  are  easily 
updated  by  simple  recursive  relations  as  each  new  observa- 
tion is  taken.  The  comparisons  made  in  this  investigation 
were  based  only  on  the  mean  squared  error  of  forecast,  how- 
ever, and  no  discounting  was  involved.   It  should  be  made 
clear  that  forecast  performance  has  been  measured  objec- 
tively without  regard  to  the  type  of  optimality  which  each 
model  seeks  to  obtain.   There  is  no  attempt  in  this  thesis 
to  extend  conclusions  beyond  such  inherent  characteristics. 
However,  it  is  hoped  that  subsequently,  when  operational 
time  series  data  must  be  forecast  using  a  model,  the  com- 
parison methodology  expounded  here  will  aid  in  the  selection 
of  the  "best"  model  for  that  data. 
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I I .   SHORT-TERM  FORECASTING  MODELS 

A.    LEAST- SQUARES  ESTIMATION  MODELS 

This  classical  prediction  method  has  evolved,  using  re- 
gression analysis,  by  considering  a  functional  relationship 
between  the  observed  values  of  the  random  process  and  an 
independent  or  control  variable.   The  General  Linear  Hy- 
pothesis is  widely  known  and  therefore  will  not  receive 
more  than  a  brief  statement  in  this  section.   An  excellent 
discussion  of  the  conditions  under  which  this  approach  is 
most  appropriate  may  be  found  in  Reference  1. 

Zehna  [Ref.  1]  and  Coventry  [Ref.2]  have  investigated 
the  advantages  of  maximum  likelihood  estimation  procedures 
and  have  shown  that  f.riese  methods  are  superior  to  rvnnnr-r,- 
tial  smoothing  methods  for  both  the  constant  mean  case  and 
the  linear  model,  where  the  forecast  errors  are  assumed  to 
be  normally  distributed  in  both  cases.   No  comparisons  were 
conducted  for  non-normal,  or  auto -correlated  series  in  which 
the  maximum  likelihood  assumptions  are  not  appropriate.   The 
relative  robustness  of  the  least  squares  method  for  various 
series  generation  model  specifications  is  one  of  the  objec- 
tives of  this  investigation.   The  simple  regression  model 
also  provides  a  standard  reference  point  from  which  the  per- 
formance of  exponentially  weighted  forecast  models  may  be 
measured . 
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1 .   Simple  Least-Squares  Forecasting 

a.  Series  Generation  Model 

The  process  observations  are  assumed  to  be  a 
linear  function  of  time,  the  independent  variable.   The 
model  may  be  written 

Yt  ■  h    +  62Xt  +  Et  (A"1) 

where  Y   is  the  random  process  observation  taken  at  time 
X  ,  and  £   is  the  random  component  contained  in  that  obser- 
vation.  It  is  assumed  that  E[£  ]  =  0  and  Var(£  )  =  a2  for 
all  t.   For  this  investigation  3i  is  taken  to  be  zero  in 
the  generation  model,  and  all  observations  are  relative  to 
that  level. 

Consideration  of  the  generation  model  reveals 
that  forecast  errors  will  be  composed  of  two  components, 
errors  of  parameter  estimation  and  the  random  variation  from 
the  true  linear  model,  Equation  (A-l).   As  the  number  of  ob- 
servations increases,  it  may  be  observed  that  the  parameter 
estimates  will  exhibit  smaller  variance  unti]  the  forecast 
error  is  almost  entirely  composed  of  the  random  variation 
inherent  in  the  process. 

b.  Parameter  Estimation 

The  least  squares  estimate  of  $~  in  the  zero- 
intercept  form  based  on  observations  y, ,y7,...,y   is  given 

Z  xiYi 

B2  =  ill (A-2) 

y   x2 
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This  form  is  used  since  all  series  in  this  comparison  begin 

at  the  origin,  X^=  0 ,  Y   =0.   The  reader  unfamiliar  with 

o       o 

the  development  of  this  estimator  is  referred  to  References 
1  and  2.   It  may  be  observed  that  Equation  (A-2)  above  is 
also  the  maximum  likelihood  estimator  of  3?  when  it  is  fur- 
ther assumed  that  e   has  a  normal  distribution.   The  vari- 
ance of  the  estimated  value  of  the  process  is  given  by 

X2 
VAR(Yt)  =  a*        t   * 

Z(X.-X)> 

i=l 

where  a2,  is  the  (constant)  variance  of  the  random  component. 
c.   The  Forecast  Model 

After  parameter  estimation  using  past  observa- 
tions through  time  X   , ,  the  forecast  value  of  the  random 
series  at  time  X   is  given  by 

Pt  =  Yt  -  62Xt.  (A-3) 

Although,  as  noted  in  Reference  1,  the  regression  line  must 
not  be  extrapolated  too  far  beyond  the  range  of  X   values 
this  method  is  well  suited  for  one-step-ahead  prediction. 
In  fact,  it  may  be  shown  [Ref.  1]  that  Equation  (A-2)  is 
the  zero  intercept  minimum  variance  unbiased  linear  estima- 
tor for  £2' 

2  .   Modified  Least->Squares  Estimation 

a.   Consequences  of  Autocorrelated  Disturbances 
One  of  the  crucial  assumptions  when  dealing 
with  least-squares  estimation  models  is  the  serial  independence 
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of  the  disturbance  term  which  is  implied  in  E[e£.']  =  a2l 
and  gives  E  [Y   Yt  +  Sl  =  0  for  all  s^O.   Autocorrelated  dis- 
turbances arise  frequently  in  the  estimation  of  relation- 
ships from  time  series  data,  and  it  will  be  seen  that  almost 
all  the  series  used  in  this  comparison  of  forecast  models 
have  significant  autocorrelation.   Some  mention  of  the  main 
consequences  of  autocorrelated  disturbances  is  therefore  in 
order.   In  the  presence  of  such  disturbances,  the  estimates 
of  3i  and  3?  obtained  by  simple  least-squares  estimation  re- 
main unbiased,  but  the  variances  of  the  estimator  may  be 
large.   This  suggests  the  possibility  of  modifying  the  pro- 
cedure to  reduce  the  variance.   Further,  if  the  usual  least- 
squares  methods  are  applied,  an  underestimate  of  these 
6  s  I  i  r°  s to  t  V3.T i  an ce s  "■  *»  Likely  t ri  o  ecu  i* . 
b.   Series  Generation  Model 

The  same  linear  model  will  be  assumed  here  as 
before  in  Equation  (A-l)  Y   =  3-,  +  32Xt  +  £t  except  now  it 
will  be  assumed  that  the  disturbances  or  shocks  are  no 
longer  independent  random  variables.   Instead,  a  first- 
order  autoregressive  model  is  used  which  is  given  by 

et  =  p^.j  +  5t  (A-4) 

where  {6.}  has  mean  zero,  variance  a|   and  zero  covariances. 
To  give  a  better  basis  for  comparison  of  the  maximum  likeli- 
hood or  least-squares  model  with  the  modified  model ,  the 
same  normal  variates  will  be  used  to  generate  the  sequence 
{6  }  in  Equation  (A-4)  above  as  were  used  to  generate  the 
sequence  {&*.}  in  Equation  (A-l). 
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c.   The  Modified  Forecast  Model 

To  predict  Y  ,  for  a  given  value  of  X  ,  the 
conditional  expected  value  of  Equation  (A-l)  may  be  taken, 
giving 

B[Yt+1|e1,...,et]  =  &i   +  ^2X  +  E  [et  +  l  I  £1 '  '  "  *  '£t] 

-  3X  +  62Xt+1  ♦  pet. 

Here  the  result  involves  the  conditional  expectation  of 
Equation  (A-4)  also.  Substituting  the  value  of  e..  from 
Equation  (A-4)  gives,  after  rearrangement, 

E[  Yt+1|ei,...,et]  =  31(1-P)  +  32(Xt+1-pX2)  +  PYt. 

This  can  be  rewritten  as 

Mv       -  mV  i  c  c-  i  -  ■)  ri-oi+srx      -ox  i 

L  ~  -L       L  '    J.  i,  j.  '«.  "  ' 

If  p  is  known,  Equation  (A-l)  may  then  be  written 

VpYt-i  =  M1"^  +  WpXt-i>  +  V  (A"s' 

This  transformation  satisfies  in  full  the  assumptions  of  the 
simple  linear  model,  and  permits  the  direct  application  of 
least  squares  to  the  transformed  variables  (Y  -pY   , )  and 
(X  -pX   , )  yielding  the  best  linear  predictors.   The  best 

linear  predictor  of  (Yt  +  i"PYt)  is  3-,(l-p)  +  Bo  (^t  +  1  "Pxt) 
where  $,  and  S~  a^e  the  least-squares  estimators  of  the 
parameters  in  Equation  (A-5) .   This  is  equivalent  to  saying 
that  the  best  linear  predictor  of  Y   , ,  conditional  on  past 
observations,  is  $,  (1-p)  +  $2(X      ,-py.   )    +  pY  which  may  be 
written  more  clearly  as 

?t+l  "  K    +  hh  +  l    ♦  p[Yt-(61  +  32Xt)].  (A-6) 
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Analysis  of  this  form  of  the  predictor  provides  some  in- 
sight and  permits  comparison  with  the  unmodified  form  of 
the  model.   By  incorporating  the  available  knowledge  of 
the  autoregressive  structure  of  the  process  a  term  is  added 
to  the  estimate  of  Y   -.  .   This  term  is  the  product  of  p  and 
the  estimated  disturbance  (actually  the  forecast  error)  of 
the  previous  period. 

For  a  further  discussion  of  the  modified  model 
and  the  effect  of  autocorrelation  on  the  linear  model, 
Chapter  7  of  Reference  12  is  recommended. 

B.    THE  SIMPLE  EXPONENTIALLY  WEIGHTED  MOVING  AVERAGE  FORE- 
CAST MODEL 

1 .   Integrated  Moving  Average  (IMA)  Processes 

ijj..u>».v;  cii«u- 1 o  is  oi  uCu  sufficient  reason  l.o  beliGv? 

that  the  more  recent  observations  of  a  process  represent 

that  process  better  than  older  data,  EWMA  methods  which  give 

larger  weight  to  the  more  recent  data  in  constructing  and 

adjusting  the  model  have  been  developed.   It  is  intuitively 

clear  that  such  discounting  of  older  data  is  not  applicable 

when  a  stationary  process  is  observed,  since  even  the  oldest 

observations  are  as  representative  of  the  process  as  the 

most  recent. 

Stochastic  processes  of  the  type  for  which  EWMA 

forecasts  are  appropriate  have  been  described  by  Box  and 

Jenkins  [Ref.  3]  as  integrated  moving  average  processes. 

These  processes  act  as  though  no  fixed  mean  exists,  but  do 

exhibit  homogeneity  in  the  sense  that,  apart  from  local 
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level  and  trend,  one  part  of  the  series  behaves  much  like 
any  other  part.   To  clarify  the  description  of  IMA  pro- 
cesses three  equivalent  forms  of  the  model  may  be  formulated 
[Ref.  3]. 

a.  The  difference  equation  form  may  be  written 
VY   =  (l-3B)e         -Ka<l 

where  V  is  the  difference  operator  and  B  is  the  backwards 
operator.   The  model  may  also  be  written  using  these  defini- 
tions as 

Yt  ■  Yt-i  -  6£t-i  +  V  c15-1) 

b.  The  random  shock  form  follows  from  the  difference 

equation  form  by  use  of  recursive  substitution  for  the  older 

obsei  v  d  i-  i  oris  , 

t-1 
Y,   =  a  T     e,  +  c,  (B-2) 


t  =  a  £  £i  + 


c 


where  y   =  0. 
7  o 


c.   The  inverted  form  follows,  writing  recursively 
in  terms  of  the  observations  rather  than  the  random  shocks, 


Y   =  Y   +  e 
xt    ll  t 


where 

Y 


00 


from  which  form,  the  model  can  easily  be  written  in  recur- 
sive form 

Vi  =  a  Yt  +  BYt  •  (B'3) 
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This  is  recognizable  as  the  recursion  formula  for  the  ex- 
ponentially weighted  moving  average.   It  follows  then  that 
EWMA  forecasts  should  be  appropriate  for  IMA  processes.   A 
demonstration  that  this  is  true  is  contained  in  section  B.3. 
The  form  of  Equation  (B-2)  is  most  convenient  for  generating 
IMA  series,  while  the  form  of  Equation  (B-3)  is  most  often 
used  for  forecasting. 

2 .   The  Series  Generation  Model 

The  specific  EWMA  generating  model  used  in  this 
study  can  be  treated  as  a  simple  random  walk  process,  one 
which  is  reasonably  stable  with  no  trends.   The  model  is 
given  by 

Y   =  Y   +  e 
t     t     t 


v  - 


>z   =  Vi  *  *t 


where  Y   is  the  observed  value,  Y   is  the  true  level  of  the 
process  at  time  t,  y      is  a  slight  random  change  from  the 
prior  process  level,  and  c.  is  a  random  shock  or  superim- 
posed error.   e.  and  y      are  uncorrelated  random  variables 
with  zero  mean  and  constant  variances  V('e  )  and  V(y)  respec- 
tively.  It  will  be  shown  in  section  B.5  that  the  EWMA  fore- 
cast model  supplies  the  optimal  linear  least- squares  predictor 
for  this  generating  model.   A  further  result,  provided  0<a<l, 
is  that  any  generating  process  for  which  the  EWMA  is  optimal 
can  be  represented  by  the  generating  model  above  in  Equation 
(B-4)  even  though  this  model  may  not  necessarily  describe 
the  true  generating  process.   It  must,  however,  describe  the 
external  behavior  of  the  true  generating  process  accurately 
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or  else  the  EWMA  would  not  then  be  optimal.   Stated  another 

way,  if  a  process  cannot  be  represented  by  Equation  (B-4) 

then  the  EWMA  forecast  cannot  be  an  optimal  forecast  rule 

for  that  series. 

3.   Optimal  Properties  of  Exponentially  Weighted  Fore- 
casts 

It  has  been  suggested  intuitively  in  the  preceding 
discussion  of  IMA  processes  that  the  EWMA  recursion  rela- 
tion describes  the  process  exactly.   Following  the  approach 
of  Muth  [Ref .  4] ,  it  can  be  shown  that  for  an  IMA  process 
the  EWMA  equals  the  conditional  expected  value  of  the  pro- 
cess . 

The  EWMA  forecast  results  from  a  model  of  expecta- 
tions which  are  adapted  to  the  most  recent  information  con- 
cerning the  process.   Let  Y   represent  that  part  of  a  time 
series  which  cannot  be  explained  by  systematic  factors  such 
as  seasons  or  trends  in  the  average.   Let  Y   represent  the 
forecast,  or  conditional  expectation  of  Y  which  is  made  at 
time  t-1  on  the  basis  of  available  information  at  that  time. 
Assume  that  the  forecast  is  changed  from  one  period  to  the 
next  by  an  amount  proportional  to  the  latest  observed  error. 
This  implies  that  a  permanent  component  exists  in  every  ob- 
served error  and  that  the  level  of  the  process  subsequently 
reflects  this  component: 

Yt  ■  \-i  +  a<Yt-rVi)   o^1  <B-5) 

This  in  turn  yields  the  EWMA.  formula: 

00 

1  =  1 

21 


Since  $  =  1-a,  the  weights  corresponding  to  previous  values 
of  Y   do  not  introduce  any  systematic  bias. 

Assume  now  that  the  observed  value  of  the  process 
can  be  written  as  a  linear  function  of  the  independent  ran- 
dom shocks : 

Yt  "  I,  wi  Vi  +  st  (B-?) 

i  =  l 

where  the  shocks  are  i.i.d.  with  mean  zero  and  variance  a2. 

If  the  parameters  W.  which  characterize  the  random 
r  1 

process  are  known,  the  expected  value  of  Y   may  be  easily 
found.   If  it  is  desired  to  do  so  one  period  in  advance  of 
time  t  when  e.     is  as  yet  unknown,  the  conditional  expected 
value  of  Y   given  the  past  values  et_-i>    %-?>  ♦••>  maY  De 


v«  «»>  w  \-*>    v-  w    /  _*_  w  . 


t  "  'Ptl't-i-Vr-l  =  .£.  ffi  Vi 

1  =  1        L 


(B-8) 


To  relate  the  regression  function  above  to  the  EWMA 
expression,  Equation  (B-8)  must  be  written  as  follows  in 
terms  of  the  observations,  where  the  values  of  the  coeffi- 
cients u.  must  be  determined. 


CO 


Vr  (B-Q) 


Substituting  for  Y.    with  the  random  shock  form 
and  rearranging  terms  results  in: 


r   =  /.  u.      .  +  Y     W. 

*    j=i  y\  l~i    i=i   i  ^^j 


=  ui  £t-i 


i^l        \  (B-10) 

U-  + 

1 

\ 


'J^^VhIvi  . 
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A  relationship  between  the  parameters  (W.)  associated 
with  the  unobserved  shocks  and  the  coefficients  (u . )  associated 
with  the  past  observations  of  the  process,  is  obtained  by  com- 
paring coefficients  of  their  respective  expressions 

Wl  "  Ul 

i-1 

W.  =  u.  +  V     u.W.     i  =  2,3,4, .. . 
i     i    fa      J  i-j 

To  demonstrate  that  the  EWMA  forecast  evolves  from 
the  above  process,  the  weights  u.  =  a3    ,  j  =  1,2,5,...  from 
the  inverted  form  are  substituted  into  the  above  expression. 
This  results  in  the  system: 

1  (B-ll) 

-;  _  i 
-  _  i      V-*1   -;  _  t 
W.  =  a  3    +  a  L     3J   If,,  ,    i  =  ^,3,%... 

j  =  l         J 

It  can  be  seen  from  the  above  that  W-  -  a  for  all  i  >  1. 

Writing  Y   in  the  random  shock  form  and  substituting 

a  =  W.  results  in  a  form  comparable  to  Equation  (B-2) 

Consideration  of  this  expression  reveals  that  the 
shock  associated  with  each  time  period  has  a  weight  of  unity, 
but  previous  shock  weights  are  constant  with  a  weight  between 
zero  and  one.   This  demonstrates  that  part  of  the  random 
shock  experienced  in  a  period  has  a  permanent  effect,  but 
the  rest  of  the  shock  affects  the  process  only  during  the 
current  period. 
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The  foregoing  assumed  that  forecasts  were  always 
for  one  period  ahead  only,  but  it  can  be  shown  that  the  best 
forecasts  for  all  future  periods  are  the  same.   The  proof 
may  be  found  in  [Ref.  4]  and  is  a  generalization  of  the  above 

00 

approach  where  Y.  „  =  .  Sn  W.  „  z^_        represents  the  forecast 
rr  t,T    i=0   i+T   t-i   r 

T  periods  ahead.   The  unique  solution  for  the  coefficients 
results  in   uT  ,  =  a  $  ,  k  =  0,1,2,...  independent  of  T  as 
asserted.   The  reason  for  this  result  is  that  all  prior 
shocks  are  weighted  equally  and  the  forecasts  then  only  es- 
timate the  permanent  component  of  the  shocks. 

Muth  [Ref.  4]  also  shows  that  the  EWMA  forecast 
rule  is  appropriate  if  the  permanent  and  transitory  com- 
ponents are  independent  rather  than  perfectly  correlated 
o  *   ..<;  i  nr»Ar!  iTio  v^    Tns  *  ■  '  o  '  Qino  cotis  "fc  *m  t  i  n  i"  jig  *  nripr.PVn'iAMt 

case  however  is  constrained  to  be  a  function  of  a  character- 
istic root  of  a  system  of  difference  equations. 

4 .   The  Forecast  Model  as  an  IMA  Process  Generator 

The  familiar  EWMA  forecast  model  is  identical  to  the 
IMA  process  in  Equation  (B-3)  F.+,  =  Y   ,  =  a  Y   +  (l-a)Y  , 
which  may  be  rewritten  in  the  random  shock  or  forecast  error 
form  of  Equation  (B-2)  by  substitution  of  the  error  e.  = 
Y   -  Y  .   This  yields 

Yt+1  =  Yt  ♦  a  et. 

Successive  substitutions  for  the  oldest  estimate  in  terms  of 
the  prior  estimate  plus  the  permanent  component  of  the  ran- 
dom shock  yields  a  form  equivalent  to  Equation  (B-2) 
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Y . , ,  =  Y   +  a  -En  e. ,  and  here  Y   =  0  is  chosen  level  from 
t+1     o     i  =  0   1  o 

which  the  process  is  measured.   The  form  of  Equation  (B-2) 
is  such  that  a  series  of  autocorrelated  forecasts  may  be 
easily  generated.   Following  Muth ' s  interpretation  [Ref.  4], 
this  generation  method  merely  adds  the  permanent  component  of 
the  random  shock  to  the  current  process  estimate  to  yield  the 
next  period  estimate. 

Since  the  next  period  has  an  associated  forecast 
error,  that  error  is  then  added  to  produce  the  desired  ob- 
servation : 


Yt+i  ■  Ft+i +  £t+i  ■ a  X0  £i +  Vi 

which  by  substitution  of  Equation  (B-2)  is  in  a  form  suit 


duie  xui  uii;  _i_ii  »,iiv^  •  or  ex  an 


,J  £>• 


»■»-»  m{ 


Th 


model,  having  no  fixed  mean,  is  free  to  drift  in  a  random 
walk.  It  may  be  observed  that  when  e.  is  reduced  to  zero, 
the  model  no  longer  experiences  shocks  and  becomes  deter- 
ministic  at  the  last  Y.  for  which  e  >  0 .  The  asymptotic 
variance  of  the  generated  series  may  be  determined  by  ob- 
serving from  Equation  (B-2)  with  t-*00  that  E[Y]  =  E['e]  =  0 
and 

2 


a*  =  E[Y2]  -  E[Y]2  =  E 


«   E    e1  Y  - 

i=0 


=  *5L.  a2.  (B-12) 

2-a 


In  generating  the  autocorrelated  series,  the  con- 
stant a  controls  both  the  amount  of  correlation  which  exists 
and  the  series  variance.   When  a=l,  the  series  is  perfectly 
correlated  with  the  random  series  and  has  variance  a2. 
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5.   Parameter  Estimation  Procedures 


To  develop  optimal  parameters  for  a  specific  series, 
consider  a  strictly  stationary  stochastic  process  with 

B[Yt]  =  0,  VAR[Yt]  =  a2,  C0V[Yt,Yt+k]  =  pka2. 

If  the  EWMA  model  is  used  for  forecasts,  then  the  mean  squared 
error  of  prediction  is  by  definition: 

M.S.E.  =  E[Yt-Yt]2 . 
Expanding  this  expression, 

M.S.E.  =  E[Y2]  -  2E[YtYt]  +  E[Y2]  (B-13) 

where  E[Y2]  =  a2.  (B-14) 

Substituting  for  Y   in  the  second  term  above  gives 

-2E[Yt  Y  ]  =  -2E 


Yt  a    £    el   Yt-i 

L    i=0 


=  -2a  £   B1  E[Y   Y    ] 
i  =  0 


(B-15) 


2a  £  3   p.  a2  =  -2aa2 
i=0     1 


Z     B1  Pi  ■ 

i=0 


Expanding  the  third  term  in  Equation  (B-13)  and  substituting 
for  Y   yields 


a2E 


T.     B1  Vi 

i  =  0 


Expanding  the  expression  formally  results  in 


a2E 


Yt  +  3  Yt_1  ♦  32  Y    +  ..- 


=  a2W 


(B-16) 
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where  W  =  E [Yt  +  3Yt_1  +  32Yt_2  +  ...]2.  (B-17) 

Since  {Yt>  is  a  strictly  stationary  process,  W  may  also  be 

written  W  =  E[Yt_1  +  3Y  2  +  S2Yt_3  +---]2-   Rearranging  W 
this  becomes 

W-  E[Yt+B(Yt.1  ♦  BYt.2  ♦  B^Yt.3  ♦  ...)]2 

which  may  now  be  expanded.   Proceeding  formally,  this  pro- 
cedure results  in 

W  =  EfY2.]  +  2E[Yte(Yt_1  +  3Yt_2  +  32Yt_3  +•••)]  +  32W 

which  may  be  written  more  briefly  as 


00 

m 

+    2E 

E 

i=l 

31Y+.    Y.     - 
t      t-i 

+    32W 

- 

«. 

+       /T2 

00 

i=l 

l 

IV  -   u   + 

K 

Here  u  is  defined  as  the  sum  of  the  first  two  terms.   Solv- 

u 


ing  for  W,  this  becomes  W  : 
Equation  (B-16)  results  in 


1-32 


Again  substituting  W  into 


a2W 


au 
1  +  3 


(B-18) 


Combining  Equation  (B-14) ,  Equation  (B-15) ,  and  Equation 
(B-18)  in  Equation  (B-13)  yields 


M  .  S  .  E 


2  62    2a 

2-a    2- 


a  L-j  "i 


(B-19) 


i=l 


which  is  the  mean  squared  error  of  one -step -ahead  prediction 
as  a  function  of  the  smoothing  constant  and  the  autocorrela- 
tion of  the  series.   This  is  a  result  obtained  by  Cox,  p.  415, 
Ref.  5.   For  a  Markov  series  with  exponential  autocorrelation 
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Pk  =  p   ,  k  >  0 

where  k  is  the  lag  between  observations,  the  mean  squared 
error  for  one-period  ahead  forecasts  becomes 


m  q  p   -  2q2C1'P) 

M*b-h-  (l-3p)(l+S) 

The  MSE  is  minimized  for  a  given  p  by  an  EWMA  predictor 

which  has  parameters  / 


(B-20) 


opt 


/1-P 

<  —zr-   or  a 
^  2p 


opt 


3p-l 
2p 


3" 


(-1 


p  <  1) 

(B-21) 
P  5  1/3) 


These  results  are  obtained  by  equating  to  zero  the  derivative 

of  Eauation  (B-20).   The  corresponding  mean  squared  error  of 

prediction  for  the  EWMA  with  optimal  parameters  is  obtained 

from  Equation  (B-20)  by  substitution 

/ 


M.S.E.  = 


8o2p(l-p) 
(1+P)2 


(1/3  <  p  <  1) 
C-l<p<  1/3) 


Analysis  of  this  result  discloses  that  when  p  >  1/3  it  is 
optimal  to  predict  the  observed  series  values  using  a  larger 
a  value  since  there  is  sufficient  correlation  between  suc- 
cessive observations  to  make  this  procedure  advantageous. 
When  p  <  1/3  the  EWMA  should  attempt  to  predict  only  the 
mean  value  of  the  process  since  so  much  uncertainty  exists 
concerning  the  next  observation.   When  the  optimal  a   equals 
zero,  the  MSE  of  prediction  equals  the  variance  of  the  time 
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series.   When  p=a=l  perfect  correlation  permits  exact  fore- 
casts and  the  M.S.E.  is  zero. 

The  preceding  results  are  obtained  by  Cox  [Ref.5] 
who  also  developed  tables  showing  the  relationships  between 
p,  a    ,  and  the  MSE/a2  ratio.   It  is  interesting  to  note 
that  as  one  predicts  more  than  one  step  ahead,  the  critical 
value  of  p  increases.   One  conclusion  drawn  from  Cox's 
tables  is  that  MSE  is  not  sensitive  to  the  choice  of  a 
smoothing  constant,  and  although  the  optimal  a  may  be  zero, 
it  costs  little  to  use  a  =  0.1  as  insurance  against  a  possi- 
ble change  in  the  mean  level  of  the  process.   This  insensi- 
tiveness  probably  accounts  for  the  success  of  the  exponential 
smoothing  model  proposed  by  Brown  [Ref.  6]  for  which  he  uses 
cir»" i ~* i " "  1  me  t hods  of  ar ame te r  estimation  ox  3  rul e  of  t numu 
a=0.1  for  general  use. 

An  alternative  approach  to  optimal  parameter  deri- 
vation for  the  EWTIA  forecast  model  used  by  Harrison  [Ref.  7] 
may  be  taken  by  considering  the  generating  model  in  Equation 
(B-4) .   Recalling  that  the  one  step  ahead  forecast  error 
e   =  Y   -  F  =  Y   -  Y  ,  and  that  the  forecast  model  is 
Y   =  Y  _,  +  ae  ,  a  difference  equation  may  be  written  to  ex- 
press the  one  step  ahead  forecast  error 

et  +  1  =  (l-cOet  +  ct  +  1  -e  t  +  yt+1.       (B-22) 

Then  the  expectation  of  the  product  of  Equation  (B-22)  with 

both  e.  -  and  e.  yields  the  variance  and  first  covariance 
t +  i      z 

respectively  of  the  one-step-ahead  forecast  errors 
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V  =  (l-a)C  +  (1+a)  V(e)  +  V(y)  (B-23) 

C  =  (l-a)V  -  V(e). 

Solving  these  two  equations  for  V  yields 

v  =  2°  vtf  *   VM  .  (B.24) 

Differentiating  this  with  respect  to  a  gives  the  minimum 
variance  parameter 

a  =  Q1<k£.    -    1   where  R  =  V|iJ.  .  (B-25) 

Substituting  for  a  in  Equation  (B-24)  it  is  found  that  the 

minimum  variance  is  V  =  =.  *• £j)  and  the  covariance  is  0.   The 

1  -a 

error  covariances  may  easily  be  shown  to  be  zero  by  taking 
the  expectation  of  the  product  of  Equation  (B-22)  with  et_- 
Mi  ■*-  jj  -  ^  9    \1 1  covaria.)  - . .  < •  .-.-»■»->  7 e to   so  it  '  ->  1  1  .-.-.t c  *■  h  -  *- 

the  predictor  is  optimal.   Although  this  approach  was  used 
by  Harrison  [Ref.  7],  he  then  recommended  that  because  of 
the  usual  shortage  of  data,  general  robustness,  and  pre- 
cision, for  short-term  forecasting  the  optimal  parameter 
should  be  derived  from  a  simulation  of  the  predictor  on  the 
original  series  data.   This  has  also  been  the  suggestion  of 
Cox,  Brown  and  others.   It  appears  that  even  though  the  op- 
timal parameters  may  be  determined  for  particular  series  on 
a  theoretical  basis,  it  is  faster  and  possibly  about  as  sat- 
isfactory in  terms  of  forecast  results,  to  determine  the 
parameters  empirically.   The  insensitiveness  of  the  simple 
EWMA  model  to  the  parameter  value  chosen  has  made  parameter 
selection  an  area  of  rapidly  diminishing  return  for  the  ef- 
fort expended. 
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C.    COX'S  MODIFIED  EXPONENTIALLY  WEIGHTED  MOVING  AVERAGE 
FORECAST  MODEL 

1 .   Series  Generation  Model  for  the  Exponential  Auto- 
correlation Function 

Cox  [Ref.  5]  has  shown,  as  noted  in  Section  B.4, 

that  the  exponentially  weighted  moving  average  is  an  optimal 

forecast  method  for  time  series  having  an  autocorrelation 

coefficient  with  lag  k  of  the  form  p(k)  =  3  .   A  recursion 

relation  which  is  identical  to  the  relation  for  exponential 

smoothing  and  which  generates  such  a  time  series  is  shown 

by  Naylor  [Ref.  8].   This  recursion  relation  is  given  by 

Y0  -  (l-«e0 

Yt  =  6Yt.1  ♦  (1-B)et 

where  {e.  \      are  mutually  independent  variables  with  zcxc 
mean  and  variance  a2. 

£ 

The  method  used  in  this  investigation  to  produce 
the  exponential  autocorrelation  function  requires  that  the 
e   be  uniform  random  numbers  in  the  interval  [-A,A].   This 
choice  of  z      results  in  exponentially   autocorrelated  vari- 
ates  Y   which  have  zero  mean  and  variance  equal  to 

2    1 "  3    2     ot    2 

where  a,  the  smoothing  constant,  equals  1-3.   That  the  re- 
cursion relation  above  gives  the  desired  result  may  be  seen 


t-1 


by  using  recursive  substitution  so  that  Y   =  a  .|n  3  £t_-+3tY 
or  in  terms  of  the  random  variable  alone 


t  *  °  A 


-1     • 

3J  e.  .  +  <x3  e  . 

j  =  0    ^ 
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The  mean  of  the  generated  time  series  follows  immediately 
since  E[Y]  =  E[e]  =  0  and  the  variance  is 

a*  -  E[Y2]  -  E[Y]2  =  E[Y2]. 

Squaring  the  recursion  relation  and  taking  expected  value 
gives  a2  =  62E[Y2_1]  +  2aSE[£  Y   ^  +  a2E[e:2]  which  gives, 
since  e.  and  Y   ,  are  independent  with  mean  zero,  the  result 

(l-$2)a2  =  a2a2. 

y      e 

Therefore,  a2  =  -r—t ?   a2,  which  is  equivalent  to  the  variance 
y   1  +  3  e 

expression  above.   When,  as  in  this  investigation,   the  c 
are  uniformly  distributed  on  the  range  [-A,A]  the  variance 
becomes 

(  9  A-)  2         „,A2 


y   1+3   12     -HJ-  +  £J 

The  autocorrelation  for  two  numbers  which  are  k  observations 
apart  in  the  time  series  is 

♦  00  ■  E[YtYt  +  k]-E[Yt]E[Yttk]  =  £-   BV 

which  implies  that  the  autocorrelation  coefficient  is 

p(k)  =  ^  K'2J    =  3   which  is  the  desired  result.   This  result 

°y 

is  obtained  by  making  a  transformation  on  the  summations  in 

the  product  Y.Y. _,  which  makes  use  of  the  stationarity  of 
r        t  t+k  ' 

the  series  c.    and  then  taking  the  expected  value  of  the  pro- 
duct Y.Y^,,  while  both  are  expressed  in  terms  of  the  random 
t  t  +  k  v 

shocks.   The  transformation  referred  to  above  is 


t  +  k  +  1  .  ,     t-1  j+k 

t+h  =  a  X0  6J  £t+k  -  j  +  **       £5a  .?0*et+k-(j+10 
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+    a3         £_k   =    3 


j  =  0 


sjet-j  +  agt£-k 


which  has  the  same  expectation  as  3  E[Y  ].   Therefore, 

J2 

y 


E[Yt  Yt  +  h]    3k  E[Y2]  =  3ka2    <|>(k)  from  which  the  result 


follows  immediately. 

2 .   Wiener's  Optimal  Linear  Predictor 

Cox  [Ref.  5],  after  developing  the  optimal . smooth- 
ie 
ing  parameters  for  exponentially  correlated  (p,  =  p  )  time 

series  which  are  shown  in  Section  B.4,  sought  to  improve  the 

model  so  that  it  did  not  lag  behind  when  used  to  forecast 

series  containing  an  increasing  mean  value.   He  made  use 

of  Wiener's  optimal  linear  predictor  for  this  same  series 

which  is  given  by: 

(C-l) 

Yt(opt.)  =  ph(Yt-u)  +  y  -  ph  Yt  +  (1-Ph)y 
where  h  is  the  number  of  periods  ahead  for  which  a  forecast 
is  desired,  and  p  is  the  autocorrelation  coefficient  of  the 
series  to  be  forecast.   The  Wiener  predictor  has  a  mean 
square  error  of  forecast  which  may  be  written 

M.S.E.W  =  (l-p2h)a2. 

The  Wiener  predictor,  although  shown  optimal  when 
all  assumptions  were  met,  had  two  properties  which  made  it 
objectionable  for  practical  use.   First,  the  parameters  y 
and  p  are  assumed  known.   This  can  be  overcome  if  sufficient 
past  observations  are  available  for  use  in  estimating  y  and 
p,  since  the  mean  squared  error  is  asymptotically  unaffected 
by  the  substitution  of  unbiased  estimates  for  these  parame- 
ters.  The  more  serious  objection  to  the  predictor,  Equation 
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(C-l)  however,  is  that  it  becomes  biased  if  the  mean  shifts, 
correspondingly  increasing  the  mean  square  error  of  fore- 
cast . 

3.   The  Modified  Form  of  the  Model 

Cox  extended  Equation  (C-l)  by  substituting  Y   for 
u  so  that  the  predictor  would  follow  shifts  in  the  process 
mean.   Equation  (C-l)  then  becomes 


Y 


m 


=  Ph  Yt  +  (l-ph)Yt  (C-3) 


or  substituting  for  Y  ,  it  may  be  written 

Ym  =  (a+gph)  Yt  ♦  3  (l-ph)Yt.r  (c.4) 

Inspection  of  these  two  equations  reveals  that  as  a->0  the 


m.Jb  u 


f  Equation  (C-3)  approaches  Lha-  of  the  Wien«*i  Die 


dictor,  Equation  (C-2),  since  Equation  (C-4)  collapses  to 
Equation  (C-l).   Also  note  that  when  p  f    0  the  model  Equa- 
tion (C-3)  is  no  longer  an  exponentially  weighted  moving 
average.   It  is  instead  a  moving  average  in  which  the  weights 
of  past  observations  decrease  exponentially,  but  in  which 
the  current  observation  receives  a  weight  which  is  not  a  mem- 
ber of  the  same  geometric  series.   The  weights  appearing  in 
the  expression  for  Y   in  Equation  (B-3)  sum  to  one,  as  do  the 
coefficients  of  Y   and  Y   in  Equation  (C-3),  thereby  main- 
taining the  desired  average.   Another  observation  concerning 
Equation  (C-4)  is  that  as  the  number  of  periods  ahead  for 
which  we  wish  to  forecast  increase,  the  modified  predictor 
tends  to  reduce  to  the  simple  EWMA  form. 
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4 .   Parameter  Estimation 

The  mean  squared  error  of  Equation  (C-3)  is  given 

by 

2aHi-ph)(i-P32+  3ph  -  6ph+1) 
M'S-E-  =  — (l-Bp)(l  +  3) (C"5) 

which  reduces  to  the  mean  squared  error  of  the  Wiener 

predictor  Equation  (C-2)  when  $=l=h.   The  values  selected 

for  a  and  3  in  Equation  (C-4)  will  necessarily  reflect  a 

compromise  between  the  desire  for  a  minimum  mean  square 

error  and  a  desire  for  protection  against  a  change  in  mean 

process  value.   As  Cox  has  shown  [Ref.  5]  however,  when  the 

optimal  value  of  a  is  0  there  is  insignificant  change  in 

mean  squared  forecast  error  for  the  EWMA  when  a  is  made 

larger  ud  to  the  range  0.1  or  0,15, 

D.    THE  HOLT-WINTERS  AND  THEIL-WAGE  TWO-PARAMETER  EXPO- 
NENTIALLY WEIGHTED  MOVING  AVERAGE  (EWMA)  LINEAR  GROWTH 
FORECAST  MODELS 

Holt  [Ref.  9]  as  further  discussed  in  Winters  [Ref.  10] 
proposed  a  forecasting  model  for  time  series  exhibiting  lin- 
ear growth  which  was  a  simple  extension  of  the  EWMA  method 
already  applied  to  time  series  with  constant  mean.   Although 
Winters'  development  explained  precisely  how  the  method 
worked,  he  has  been  criticized  by  Theil  and  Wage  [Ref.  11] 
and  others  for  failing  to  explicitly  justify  the  method. 
The  rationale  for  the  criticism  seems  to  be  that  Winters 
failed  to  formulate  an  explicit  stochastic  model  as  a  basis 
for  the  forecasting  method.   Instead,  Winters  used  a  com- 
pletely empirical  method  for  selection  of  parameters.   It 
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must  be  observed,  however,  that  Winters  utilized  time  series 
of  actual  data  from  three  dissimilar  processes.   His  ap- 
proach is  probably  representative  of  those  currently  used 
when  real  data  must  be  forecast.   For  real  data,  when  the 
model  is  in  fact  unknown,  Winters'  approach  implicitly  as- 
sumes various  underlying  models  and  his  choice  of  parameters 
which  gave  the  least  forecast  variance  is  his  implicit  spec- 
ification of  the  underlying  model.   For  the  purpose  of  this 
investigation  the  criticism  raised  by  Theil  and  Wage  is 
considered  valid,  and  their  generating  model  will  be  used 
along  with  a  discussion  of  their  determination  of  optimal 
parameters.   It  may  be  added  that  Harrison  [Ref . . 7]  has 
postulated  yet  another  generating  model  for  the  Holt-Winters 

1   '         ■  -      -  --•  1  '.  •    r   r [.Unlitlw       -^ *-V»a       TVio^  1        '-'••  eta,      mnJol 

and  has  also  derived  optimal  parameters. 

1 .   The  Linear  Growth  Series  Generation  Model 

The  Linear  growth  model  proposed  by  Theil  and  Wage 
[Ref.  11]  is  given  by 

Yt  -  Yt  +  et 

Yt  =  Yt.1  ♦  bt  CD-I) 

bt  =  bt.!  ♦  «t  . 

Here  z.    is  again  a  random  shock  or  residual  by  which  the  ob- 
servation Y   differs  from  its  mean  value  Y   and  the  change 
attributed  to  trend  or  the  slope  b   is  changed  by  a  random 
amount  6  .   The  random  variables  e.  and  6   are  assumed  to 
have  zero  mean,  constant  variance  and  zero  covariances. 
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2 .   The  Forecast  Models 

a.   The  Holt-Winters  Model 

The  Holt-Winters  forecast  model  is  given  by 

Ft  =  Y   +  kbt  (D-2) 

where  k  =  1  for  one-step-ahead  forecasts.   This  is  the  orig- 
inal exponential  smoothing  model  in  which 


and 


Yt  =  a  Y   +  *(?  *   bt_x)  CD- 3D 


bt  =  a'  (Yt  -  Yt_1)  +  3,bt_1.  (D-4) 


The  first  parameter  a  in  Equation  (D-3)  is  the  smoothing  con- 
stant for  the  process  level.   Note  that  Equation  (D-3)  is 
identical  to  (B-3)  except  for  the  addition  of  the  slope  to 
the  previous  process  level.   The  second  parameter  a'  in 
Equation  (D-4)  is  the  smoothing  constant  for  the  exponentially 
smoothed  slope  estimate.   Holt  and  Winters  place  no  constraints 
on  the  relation  between  a   and  a'  as  do  Brown  [Ref.  6]  and 
Theil  and  Wage  [Ref.  11]. 

b.   The  Theil -Wage  Model 

Theil  and  Wage  have  suggested  that  Equation  (D-3) 
neglects  available  information  and  propose  to  replace  it  with 

Yt  =  a  Yt  +  PCY^  +  bt)  (D-5) 

which  they  consider  a  more  simultaneous,  rather  than  a  re- 
cursive, approach.  Taking  Equation  (D-4)  and  substituting 
into  (D-5)  they  obtain 
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y\     y\ 


Y^  = 


t  =  a  Yt  *  a[Yt.1  +  a'(YfYt.1)  +  S'b^] 

=  3a»Yt  +  aYt  +  33'  (Y^  +  b^) 
and  therefore,  solving  for  Y 

a    v      33' 


Y   =  — - —  Y   + 
t    l-3a'    t    l-3a' 


[Yt.2  ^  bt-1]  (D-6) 


which  is  of  the  same  general  form  as  Equation  (D-3),  but 
the  smoothing  and  discount  parameters  are  changed  to  obtain 
all  the  information  contained  in  the  observations.   Note 
that  when  a1  =  0,  3'  =  1.   Equation  (D-6)  becomes  identical 
to  Equation  (D-3),  because   when   .  a1  =  0,  correspondingly, 
the  slope  is  constant  and  therefore  makes  no  contribution. 
3 .   Parameter  Estimation  Procedures 
a.   The  Holt  Winters  Model 

The  empirical  parameter  selection  method  used 
by  Winters  [Ref.  10]  was  a  steepest  ascent  search  for  the 
optimal  parameters.   While  this  may  be  appropriate  for 
critical  forecasts,  it  is  recalled  that  the  objective  of 
this  comparison  is  finding  a  forecast  technique  suitable 
for  routine  forecasts  of  many  items.   Clearly,  a  search  is 
inappropriate  for  each  of  them.   Instead,  use  will  be  made 
of  Winters'  findings  concerning  the  best  composite  parameter 
values  obtained  during  his  investigation. 

The  evaluation  method  used  [Ref.  10]  to  arrive 
at  the  best  composite  values  was  to  express  the  forecast 
error  standard  deviation  for  each  parameter  combination  as 
a  percentage  above  the  minimum  standard  deviation  achieved 
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each  product,  and  to  add  across  the  three  products  for  each 
parameter  combination.   The  best  composite  rating  of  24%  was 
achieved  by  parameter  set  (.2,  .1)  for  (a, a').   These  param- 
eters will  be  used  in  this  forecast  comparison  with  the 
observation  that  they  may  not  be  the  best  parameters  for 
the  particular  series  used  here.   As  noted  above,  however, 
for  mass  forecasts  these  parameters  must  suffice.   The  ro- 
bustness or  insensitivity  of  the  model  to  various  series 
should  become  apparent  regardless  of  the  parameters  chosen. 

Although  neither  Holt  nor  Winters  gave  a  pro- 
cedure for  optimal  parameter  estimation,  Harrison  [Ref.  7] 
develops  optimal  parameters  for  a  specific  linear  growth 
model 


Y   =  Y   +  e 
t    t    t 


Yt  ■  Tt-i  +  bt  +  yt 
bt  ■  bt-i  +  6t 

for  which  Holt's  predictor  gives  the  least  mean  squared 
error.   Following  Harrison's  procedure,  the  difference  equa 
tion  form  of  the  Holt-Winters'  Predictor  given  by 

A  A  /\ 

Yt+1  "  Yt  +  bt 

Yt  =  Yt-1  +  bt-l  +  aet 

bt  =  bt_1  ♦  a'et 


may  be  written  more  compactly  as 
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et+1  =  aet  +  be^  -  Et+1  (D-7) 

where 

a  =  2  -  a  -  a'  (D-8) 

b  =  -(1-a)  =  -3 


Et+1  =  £t  +  l  "  2et  +  et-l  +  Yt  +  1  "  Yt  +  5t+l 


Expanding  Equation  (D-7)  gives 


(D-9) 

t+1' 

(D-10) 


where 


and  where 


i  =  0 


Wk  "  a  Wk-1  +  b  Wk-2 


o   ' '  "i   "i   'l2   a'  ": 


? 


When  A.,  and  A-  are  roots  of  the  equatio 


n 


z   -  az  -  b  =  0 


it  is  in  general  true  that 

A  k  -  A  k 
1      2 

k-1     Ai  "  x2   ' 

From  Equation  (D-10)  and  Equation  (D-ll)  it  follows  that  the 
error  variance  of  forecasts  is  given  by 

CO 

V(e)  =   £  [(A2Wk)2V(c)  +  (AWk)2  V(6)  +  W2  V(6) ] . 
k=0 

This  variance  error  tends  to  a  limit  if  |W,  J<  1,  which  im- 
plies that  | A, |  and  | A~ |  are  less  than  unity  also.   For  sta- 
bility of  the  expression  it  follows  that 
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a  ±  (a2  +  4b) 
2 


<  1. 


When  a  substitution  is  made  for  a  and  b  the  stability  con 
ditons  become 


ft  .   ,  .   (g+g')2 
0  <  a   < j — - — 


2a  +  a1  <  4 


(D-12) 


(D-13) 


The  optimal  parameters  must  satisfy  these  conditions,  and 
since  they  will  be  determined  as  functions  of  the  variances 
V(e) ,  V(y)  and  V(6)  the  parameters  are  further  restricted  by 

0  <  V(e) ,  V(y) ,  V(6)  <  V(min) . 


Ill  JlS      UHU  HCS       I  i  i  a  i.       LilC      0  j  >  i,  J.  Ilia  J.      j  >  a.  V  a.iii\_  C  S  i  S      W  j.  a  x      u  v..      v_liiiuaj  ii. 

within  the  subset  of  the  stability  region  defined  by 


0  <  0.  <   Ca+a')^   <  x 

2  +  a+  a 


(D-14) 


0   <  a  <  1 

To  determine  the  optimal  parameters,  Equation  (D- 7)  can  be 
multiplied  by  both  et+1  and  e  ,  giving  two  equations  which 
may  be  written 


2-^2 


(l-b")S"  =  a(l+b)c1  +  bd2  +  dQ 


aS   =  (l-b)c,  -  d. 


(D-15) 


where  d.  =  E[e..E  ]}  s   and  c1  are  the  variance  and  first 

2 
covariance  or  the  errors,  C.   Eliminating  S   from  the  equa 

tion  gives 
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(l+b)[(l-b)2  -  a2]cx  =  (l-b2)d1  +  abd2  +  adQ  (D-16) 

It  is  shown  in  Appendix  1  to  Ref.  7  that  the  constants  a,  b, 
and  V  satisfy  the  stability  conditions  for  the  following  when 
defined  in  such  a  way  that 

bV  =  V(e)  =  d? 

L  (D-17) 

aV  =  (4-a)V(e)  +  V(y)  =  ~\ 

V  =  (6-4a+a2+b)V(e)  +  (2-a)  V(y)  +  V(S)  =  d 
d.  =  0  for  I j I  >  2. 

Given  these  values,  the  right  hand  side  of  Equation  (D-16) 
simplifies  to 

V[-a(l-b2)+ab2  +  a]  =  0.  (D-18) 

Referrine  +  o  ~~-t'<-'~"    fn  81  and  Eauation  •rr''-r'-"  and  substitu- 
ting for  a  and  b  in  the  above,  making  use  of  the  stability 
restrictions  in  Equation  (D-14),  it  is  seen  that  the  coeffi- 
cient of  c,  in  Equation  (D-16)  is 

act'  [4-2a-a'  ]  >  0  . 

Since  Equation  (D-16)  equals  zero  by  Equation  (D-18),  it 

follows  that  c,  =  0,  and  from  Equation  (D-15)  and  (D-17)  it 

2 
follows  that  S  =V.   Since  all  other  covariances  can  be  shown 

to  be  zero  by  multiplying  Equation  (D-7)  by  et_-,  i=l,...°°, 

it  follows  that  the  predictor  is  optimal  when  the  parameters 

satisfy 

V(e)  =  (l-a)V 

V(y)  =  (a2+aa'-2a')V 

V(6)  =  a,2V. 
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Here  V  is  the  minimum  variance  of  forecast  error.   The  prob- 
lem remains,  however,  of  estimating  the  variances  of  the 
random  components  of  the  process.   Harrison  suggests  the  use 
of  Serial  Variation  Curve  analysis  of  the  observed  data  to 
assess  the  suitability  of  the  model  for  forecasting  the  data 
and  to  provide  an  estimate  of  the  variances  needed  to  select 
the  optimal  parameters.   By  examining  the  first  differences 
of  the  data,  Harrison  [Ref.  7,  p.  833]  states  that  the  Serial 
Variation  Curve  of  the  first  differences  is  expected  to  be  a 
straight  line  with  gradient  V(5)  for  i>2,  or 

E[AYt-AYt_1]2  =  4V(e)  +  2V(y)  +  i  V(6) ,  i>2 
=  6V(e)  +  2V(y)  +  V(6)   ,  i=l. 


this  method  for  determining  the  variances  has  a  large  error, 
and  concludes  here,  as  before  in  his  derivation  of  parameters 
for  the  simple  EWMA  model,  that  the  optimal  parameters  are 
again  best  determined  by  simulation  (empirically)  using  the 
actual  data.   For  purposes  of  this  comparison,  then,  the 
parameter  set  recommended  by  Winters  is  still  retained, 
b.   The  Theil-Wage  Model 

To  determine  the  adaptation  parameters  a  and  a' 
which  minimize  the  mean  square  forecast  error  the  forecast 
error  must  first  be  expressed  as 

et  -  Y   ♦  b   -  Y   -  Yt  ♦  bt  -  (7t.1+bt.1+6t+et) 
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where  the  value  of  Y   in  Equation  (D-l)  is  substituted  into 
Equation  (D-2).   This  may  be  written  in  more  concise  nota- 
tion as 

et  ■  At  +  Bt  -  £t  -  «t  CD-19> 

where 

At  ■  ?t  -  Vi  •  Bt  ■  bt  -  bt-i. 

To  insure  that  the  notation  is  clear,  recall  that  the  fore- 

A  A 

cast  of  Y.  is  F^  =  Y.  +  b.  which  can  be  considered  as  an 
t     t     t    t 

estimator  of  Y.  expressed  as 

*t     t-1    Dt-1     t     t 
From  Equation  (D-19)  it  can  be  seen  that  the  forecast  error 
is  the  sum  of  three  terms: 

the  sampling  error  (A,)  of  the  mean  process  level  at  time 
t-1, 

the  sampling  error  (B  )  of  the  slope  change  at  time  t-1, 
and  a  disturbance  combination  associated  with  the  t^h  period. 
Using  the  forecast  model  Equation  (D-3)  and  Equation  (D-4), 
the  sampling  errors  can  be  eliminated  successively  from 
Equation  (D-19)  and  it  can  then  be  written  as 

Yt  "  Yt  -  CVi-Yt-i)-<Vi-V7t:-i)  ■  "aet-r 

Further  rearrangement  reveals  that  A.  can  be  written  as 

At  =  (1-cO  (At_1  +  Bt_1)  +  aef  -  (l-o)6t.        (D-20) 
The  same  procedure  permits  B   to  be  written 

Bt  =  -cxa'At_1+(l-aa,)Bt_1  +  aa,et-(l-aa,)5t.    (D-21) 
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Finally  these  results  may  be  expressed  in  vector  form  as 


A. 


uBt. 


=  p 


"\-f 

~£t  ' 

-Vi- 

+Q 

-6t- 

(D-22) 


where 


ra       3-1 


P  = 


,  Q  = 


a  -  $  I 


=  aa 


=  1-0 


Analysis  of  the  above  reflects  that  an  optimal  a  and  6  imply 
an  optimal  a'  also.   By  successive  elimination  of  the  vec- 
tor 

At-i 

L»t-H 


in  Equation  (D-10)  that  equation  then  may  be  written 


■v 


V 


r-A 


=  P 


n 


L 


B 


t-n 


t-n 


n-1 
+  V     PkQ 

k=0 


t-k 


L6 


t-k 


(D-23) 


When  a  and  a'  are  positive  (the  latent  roots  of  P  are  within 
the  unit  circle) ,  the  first  term  on  the  right  of  Equation 
(D-23)  converges  to  zero  when  n->°°.   Combining  this  equation 
with  Equation  (D-19)  the  forecast  error  may  be  written 


-et-k 

6 


£t*l  "  6t*l 


et  -  [1  1]  £   PkQ 

On  expanding  and  taking  the  expectation  formally  the  mean 
square  error  is  seen  to  be 


(D-24) 
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MSE  =  [1  1] (QDQ'+PQDQ'P,+P2QDQ,P'2+. • •) [*]  +    o2    +    o 
where 


(D-25) 


D   = 


'o2        0 


0   a 


6J 


To  simplify  the  MSE  expression,  the  term  in  parenthesis  may 
be  grouped  as   S  =  QDQ ' +PQDQ ' P ' +P2QDQ ' P • 2  + • • •  and  PSP'  = 
PQDQ'P'  +  P^QDQ'P •+• • •  may  be  subtracted  from  both  sides  of 
Equation  (D-25)  giving 


S  -  PSP'  =  QDQT 


(D-26) 


The  result  can  then  be  regarded  as  a  set  of  linear  equations 
in  the  three  elements  S, , ,  S, ? ,  S?~  of  S  since  it  is  seen 
that  S  is  symmetric.   Equation  (D-26)  can  then  be  written 
in  explicit  form  as  follows 


bll    12 


S12   S22J 


r  3   6  "irSi;L   S12 


^12   ^22 


rB   -6 


3   ev 


ret   -  31 

e  -  e' 


ra2 


6  J 


a 


-3 


which  after  rearrangement,  has  the  solution 


'11 


12 


S13J 


a9(4-2a-9) 


r(2-a8)  e 


-36 


2326 


32(l+3) 


~a2a2  +  32o2 


a6a2  +  3  9' a2 

E         0 

_S2a|  +  9'2a|_ 


46 


(2a(l+3) -9)9  3(a(l+3)-9) 


_(l+3)92   2(a2-2a+9)9   a2  (1  +  3)  +293-1 


(D-27) 


Returning  to  Equation  (D-26)  it  is  noted  that  S  must  be  pre- 
multiplied  by  [11]  and  post  multiplied  by  [11]'.   This  sug- 
gests the  form  S, ,  +  2S, »  +  S~~  so  Equation  (D-27)  is 
premultiplied  by  [121].   The  product  of  this  vector  with  the 
3x3  matrix  on  the  right  side  of  Equation  (D-27)  results  in 
[26  20   1+6] . 

Combining  this  with  Equation  (D-26)  the  mean 
square  prediction  error  is  found  to  be 


MSE  =  S-..  +  2S19  +  S99  +  a2  +  ol    =  a2  ^Alal    *\ 

11      12     22  e  6     e  a(2(l+3)-8) 

+  r2      (1+B)      _  ai  (g2(l  +  3)  +  4aQH-202)    r   ?jn 

C6  a6(2(l+3)-e)   "  ae  a8(2(l+S)-6)      eU  Z8J 


where 


g2  =  a2  /a2  (D-2yj 


the  ratio  of  the  slope  change  variance  to  the  additive 
error  term  variance.   The  nature  of  the  random  processes 
normally  encountered  suggests  that  g  is  often  a  small  num- 
ber.  Now  that  the  MSE  has  been  obtained  in  fairly  simple 
form,  the  optimal  value  of  0  may  be  found  by  taking  the 
derivative  of  Equation  (D-28)  with  respect  to  0  and  equating 
the  numerator  to  zero.   Solving  the  resulting  quadratic 
equation  in  0,  the  optimal  0  value  may  be  written 

0  =  h2(l+3)  (D-30) 

where 

h2  =  -1/8  g2  +  |g(l+l/16g2)^  •        (D-31) 

Equation  (D-30)  gives  the  value  of  0  which  will  minimize  the 
mean  square  forecast  error,  given  a  specific  a.   Recalling 
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that  6=aa'  it  is  seen  that  G  is  a  decreasing  linear  function 
of  a.   As  a  matter  of  convenience  it  has  been  found  easier  to 
work  with  h  than  with  g  so  solving  Equation  (D-31)  for  g  re- 
sults in 

g2  =  4hV(l"h2)  .  (D-32) 

Substitution  of  the  6  value  of  Equation  (D-30)  into  Equation 
(D-29)  gives 


MIN  MSE  -  a2  Cg'+4h*+2h2 (2-h2) a) 
MIN  MSE    a£  -J,h2(2-h2)a(l  +  Bl 


=  a 


2  4h2/(l-h2)+2a 
e     a(l+3) 


(D-33) 


Differentiation  of  Equation  (D-33)  with  respect  to  a  and 

putting  the  result  equal  to  zero  gives  a  quadratic  equation 

in  a.   The  results  are  found  to  be 

(D-34) 


a  = 


2h 
1+h 


,  3 


1-h 
1  +  h 


a'  =  h 


2h2 
1  +  h 


3'  =  1-h 


A  substitution  of  the  results  of  Equation  (D-34)  into  Equa 
tion  (D-33)  gives 


MIN  MSE  =  a2   ±±£  =  a2/3. 

a.e      e  1_h 


(D-35) 


This  result  has  been  tabulated  in  Reference  11  for  various 
values  of  g2 .   For  an  illustration  of  the  method,  however, 
the  optimal  parameters  for  the  Theil-Wage  forecast  model 
will  be  computed  here,  assuming  that  a2  and  ol   of  Equation 
(D-l)  are  25/3  and  4/3  respectively. 

e.  -  u(-5,5)=J>a2  =  (5  +  5)2/12  =  25/3 
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6t  ~  u(-2,2)=£>a2  -  (2  +  2)2/12  =  4/3 

from  which 

g2  -  *\    I    °*£    -    -16 

and 

g  =  0.4 

from  Equation  (D-30).   From  Equation  (D-13)  it  follows  that 

h2  =  -1/8  (16/100)+|(4/10)  (1+.16/16)1'5  =  0.181 

h   =  .425  =  a' 

a   =  2(.425)/1.425  =  .850/1.425  =  0.596. 

These  values  of  a  and  a'  are  those  used  in  the  computer  com- 
parison of  the  Theil-Wage  model.   These  parameters  will  insure 


+  U  o  <-     fKp    mnr)f>1      '■  c     nrtimal      fni      the-     Rtf> 


cordance  with  Equation  (D-l)  where  the  range  on  e      is  10  and 
the  range  on  6   is  4.   Using  the  results  of  Equation  (D-35) 
it  is  noted  that  the  min.  M.S.E.  should  be  approximately  20.(5. 

E.    BROWN'S  ONE -PARAMETER  EWMA  LINEAR  GROWTH  MODEL 
1 .   The  Series  Generation  Model 

The  series  generation  model  for  Brown's  forecast 
rule  is  simply  the  forecast  rule  with  random  numbers  applied 
as  the  shocks  to  process  level  and  slope.   Comparison  with 
Equation  (E-5)  will  reveal  that  the  forms  are  equivalent. 
The  model  may  be  written 

Yt  ■  \  +  et 

Tt  -  Vi  +  bt  +  et  (E"15 

bt  -  bt-i  +  £t 
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where  Y   is  the  observation  at  time  t,  Y   is  the  process 

level  at  time  t,  tK  is  the  process  slope  at  time  t  and  e^ 

t        r  t 

is  the  random  shock  observed  as  a  forecast  error  at  time 
t.   e'  is  a  random  shock  experienced  by  the  process  level, 
and  Brown's  forecast  rule  writes  this  shock  in  terms  of 
the  forecast  error  e'  =  (1-B2)e  .   eV  is  the  random  change 
in  process  slope  which  Brown's  model  assumes  may  be  written 
£*'  =  a2ef.   This  model  is  essentially  the  linear  growth 
model  in  Equation  (D-l)  with  Brown's  assumptions  concerning 
the  relationship  of  the  various  random  elements. 
2 .   The  Forecast  Model 

Brown's  forecast  model  as  discussed  in  Reference  7, 
is  essentially  a  special  case  of  the  Holt-Winters  model 
Equation  f l) -  '•  I  and  E  c  u  a  t  i  on  f D -  4 1  ~-  n  wh i ch  h e  restricted  t h ^ 
second  parameter  to  be  a  function  of  the  first.   The  fore- 
cast rule  is  identical  to  Equation  (D-2) 

Ft  =  Yt  ♦  k  bt  •  (E-2) 

To  illustrate  the  similarity  of  Brown's  model  to  the  Holt- 
Winters  model,  the  latter  model  may  be  written  in  the  ran- 
dom shock  form  using  the  forecast  error  as  the  shock 

et  ■  Yt  -  (Vi  +  \-J- 

Equation  (D-3)  becomes 

and  Equation  (D-4)  becomes 

bt  =  b^   *   „B«et  CE-4) 
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When  Brown's  double  smoothing  model  is  written  in  the  same 
form  [Ref.  7],  the  similarities  are  easily  seen  as 

bt  -  bt.j  ♦  (l-g)2  et. 

The  Holt-Winters  parameter  a,  which  smooths  the  process 
level  in  Equation  (E-3)  is  equivalent  to  Brown's  parameter 
(1-32)  or  a(2-a).   The  Holt-Winters'  parameter  product  a' 
in  Equation  (E-4)  is  equivalent  to  Brown's  a/2-a.   When 
these  equivalent  parameters  are  numerically  equal,  then 
Brown's  model  will  yield  the  same  result  as  the  Holt-Winters 
model.   The  forecast  model  used  in  the  computer  program  for 
this  comparison,  however,  is  the  more  familar  from  of  Brown's 

Yt  =  2Yt  -  y|"j  (E-6) 

bt  =  |  [Yt  -  Y[2)]  (E-7) 

where 

Yt  -  aYt  +  3Yt-1  (E-8) 

Y^2)  =  aYt  +  &[2_\.  (E-9) 

Equation  (E-6)  expresses  the  estimate  of  the  lag-corrected 
current  process  level,  and  Equation  (E-7)  expresses  the  cur- 
rent estimate  of  the  process  slope.   Equations  (E-8)  and 
(E-9)  are  the  single  and  double  smoothed  expressions  used  to 
calculate  the  level  and  slope.   The  method  involved  in  these 
calculations  makes  use  of  the  known  fact  that  single  smoothing 
(the  simple  exponentially  weighted  ir.oving  average  of  Equation 
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(E-8)  )  lags  a  trend  by  a  constant  amount.   The  double  smoothed 
version  in  Equation  (E-9)  lags  the  single  smoothed  value  in 
the  same  amount  by  which  single  smoothing  lags  the  observed 
trend.   If  Equation  (E-6)  is  rewritten  as  Yt  =  Y   +  (Y^Y^2)) 
the  term  in  parenthesis  is  recognized  to  be  the  lag  correc- 
tion required  to  give  the  current  adjusted  estimate  of  pro- 
cess level.   The  amount  of  lag  inherent  in  Equation  (E-8) 
for  trend  series  is  the  observed  result  of  using  past  data 
to  estimate  the  current  level.   This  lags  corresponds  to 
the  "average  age"  of  the  past  data  used  in  making  the  es- 
timate, and  the  error  due  to  this  lag  can  be  shown  to  be 

et  -  B/a  bt 

ilicil/iu  t  ^       Liie       C3v--i-iucii-v^       \_>  -i-       .)  J  u  y  o       _„.--      w>^       *Vij->->,v^ji       u.i       uuum,j.uii 

(E-7),  where  e   =  Y  -YJ;  \      Brown  extends  this  approach 
further  to  form  what  he  terms  third  order  smoothing,  and 
higher  levels,  but  these  forms  are  not  relevant  to  the  pre- 
sent comparison. 

3.   Parameter  Estimation 

Brown  advocates  an  empirical  approach  in  "fitting" 
the  forecast  to  the  series,  but  he  also  says  that  01=0.1  is 
a  good  multi-purpose  value.   Since  the  Holt-Winters  parame- 
ter values  were  taken  from  the  literature  as  their  best  es- 
timate of  a  good  value  combination,  Brown's  value  will  be 
taken  as  such  also.   While  this  value  may  be  improved  upon 
for  specific,  series  by  experimentation,  this  superficial  se- 
lection is  in  keeping  with  the  objective  of  this  study  -  to 
forecast  vrried  types  of  series  at  minimum  cost. 
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F.    THE  BOX- JENKINS  POLYNOMIAL  PREDICTOR  -  THE  GENERAL 
MODEL  OF  ORDER  N 

1 .   The  General  Polynomial  Growth  Model 

The  generating  model  for  which  the  Box-Jenkins 

Polynomial  predictor  is  the  optimal  linear  least  square 

predictor  is  discussed  by  Harrison  in  Reference  7.  The 

general  polynomial  model  specifies  that  each  "derivative" 

of  the  underlying  process  experiences  a  random  change  in 

such  a  way  that  for  the  N  order  polynomial  the  model  is 

given  by: 

Yt  -  Y™    *  et  (F-l) 


Y 


(i)  =  y(i)  +  Y(i+1)  +  Y(i)  i=l     n    fF-21 


Uo^-:.     v  v.  _     o     p  ?-j  rl     t-  Vi  /=>     rSHdOTl     erXOrS      £         qnrl     v^     ^      lisv,"     moan 

*  t  t       t 

zero  and  variance  a2I.   Comparison  of  Equation  (F-l)  and 

Equation  (F-2)  with  the  steady  model,  Equation  (B-4),  re- 
veals that  the  latter  equation  results  from  i=l  in  Equations 
(F-l)  and  (F-2).   A  similar  comparison  of  the  linear  growth 
model  Equation  (D-l),  reflects  that  this  form  results  from 
setting  i=l,2  in  Equation  (F-2).   The  particular  model  in 
Equation  (D-l),  has  y£2^=  b   and  y^    =  0  but  y*    could 
easily  have  had  a  positive  value  and  nothing  in  that  analysis 
would  have  changed  except  the  complexity  of  terms  in  the  de- 
velopment . 

2 .   Optimal  Properties  of  the  Polynomial  Predictor 
For  the  assumed  generating  model  the  Box-Jenkins 
polynomial  predictor  can  be  shown  to  be  optimal  in  the  lin- 
ear least  squares  sense  and  the  optimal  values  of  the 
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forecasting  parameters  a.  can  be  derived  as  functions  cf  the 
variances  of  the  random  shocks  e.  and  yi   ■   A  special  case 
of  the  Box-Jenkins  result  derived  on  pp.  312,  313  of  Ref- 
erence 14  will  be  employed  and  the  procedure  followed  by 
Harrison  on  p.  824  of  Reference  7  will  be  used  to  show  the 
result. 

Let  a  stochastic  process  by  generated  by  the  model 

Vi  ■  X0  "j  Vj  +  £t+i  ^ 

where  the  i^f}   have  zero  mean  and  are  identically  distributed 

uncorrelated  random  variables,  and  the  r\ .  are  constants.   Con- 

J 

sider  a  forecast  rule  of  the  form 


y\ 


Y   .  =  V  y.  Y   .  fF-41 

t+1    .«-*..   :   l  -  i 
j-u       - 

where  the  u .  are  constants.   Since  it  is  true  that 
3 

(F-5) 

E[et  +  1]2  =  B[Yt+r?t+1]»  =  B[  E  Cnj-yj)Yt   ]^E[et+1]2 

the  forecast  rule  will  be  optimal  in  the  least  squares  sense 
when  u.  =  n • •   When  this  is  true,  Equation  (F-5)  implies  that 
the  forecast  errors  {e.}  are  identical  to  the  random  shocks 
{c  },  and  therefore  are  also  uncorrelated.   By  successive 
substitution,  the  Equation  (F-4)  form  of  the  forecast  rule 
may  be  transformed  in  terms  of  the  forecast  errors  as 

where  the  W.  are  constants.   Writing  Equation  (F-6)  in  dif- 
ference equation  form 
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Wt  +  1  =  I  W  e  (F-7) 

it  follows  from  the  previous  analysis  that  this  forecast  rule 
is  optimal  for  the  equivalent  underlying  stochastic  process 

oo 

VIt  +  l    A*   j  't-j     bt  +  l 

since  the  forecast  errors  were  equivalent  to  the  random 
shocks.   Specifically,  the  forecast  rule  is  optimal  for  a 
series  generated  by  the  model 

VYt+1  =  Y    Y   SJ  e   +  Ve  (F-8) 

j  =  0   J 

where  S-'  is  used  to  denote  the  j tn  multiple  sum.   Differenc- 
ing Equation  (F-8)  n-1  times,  Harrison  arrives  at  the  expres- 

s  x  o  n 

n-1 

V  Y+Al  =  T     y.    V   .  ,  e„  (F-9) 

n  t+1    .«0  ' j   n- j -1   t  v 

which  has  no  error  terms  beyond  e.   , -.  .   The  result  which 

'      t-n+1 

has  been  found  states  that  if  a  random  observation  y  has 
the  property  that  its  ntn  difference  can  be  represented  as 
a  moving  average  process  of  iid  variables  {e.}   which  have 
zero  means  and  the  process  is  of  order  n+1,  then  the  Box- 
Jenkins  n  order  polynomial  predictor  given  by 

n-1 

i  =  0 
is  optimal  and  the  {ef}  represent  the  one -step-ahead  fore 
cast  errors  for  this  optimal  rule. 


?t+i  ■  K  +  X  "i  sl  et  (p-10) 
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3.  Parameter  Estimation 

The  procedure  used  by  Harrison  [Ref.  7]  shown  in 
section  B-5  for  the  simple  EWMA  model  demonstrates  the 
Box-Jenkins  parameter  estimation  procedure  for  the  steady 
model.  The  optimal  predictor  for  the  linear  growth  model 
is  the  form  of  the  general  predictor  recommended  by  Holt- 
Winters.  Harrison  has  derived  [Ref.  7]  the  optimal  parame- 
ters for  this  model  as  previously  described  in  Section  D.3.a. 

4 .  The  Relationship  of  Special  Cases  of  the  General 
Polynomial  Predictor 

The  forecast  models  proposed  by  Holt-Winters,  Theil- 
Wage ,  and  Brown  are  specific  forms  of  the  Box- Jenkins  Poly- 
nomial Predictor.   The  demonstrate  the  relationship,  Equation 
(F-10)  can  be  written  in  the  form 

?t+1  -    r  ^1}  c*-11) 

1=1 

where 

?ti}  ■    Xj  ftJ-l  +  aiet  (p-12) 

and 

e   =  Y   -  Y 
et     t     t 

This  form  resulted  from  expressing  the  Y   in  terms  of  the 

errors  and  substituting  in  Equation  (F-ll) . 

a.   The  Simple  Exponentially  Weighted  Moving  Average 

The  first  order  predictor,  where  n=l  in  Equation 

(F-ll)  causes  that  expression  to  reduce  to  the  simple  EWMA 

model 

Y     =  Y 

*t+l    !t 
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Y   -  Y     +  ae 
rt     t-1      t 

where  equivalence  to  Equation  (B-3)  is  seen  by  substitution 
for  the  error  in  terms  of  the  observation  and  forecast, 

K  -  Vi  ■  aYt  +  d-'J  V 

b.   The  Holt-Winters  Linear  Growth  Model 

When  n=2  in  Equation  (F-ll),  the  second  order 
predictor  becomes 

Yt+1  "  Yt  +  bt 

*t =  Vi  +  bt-i  +a  et 


where  v(^) 
t 


bt  =  bt_1  +  a'et 


c-  -,.t  y  t  ■f-  f-  p.  v>   a  c-   Vi 


c.  The  Brown  Linear  Growth  Model 

Brown's  second  order  predictor  is  a  particular 
form  of  Holt's,  where  the  two  parameters  a,  a'  are  restricted 
to  be  functions  of  the  discounting  parameter  B, 

a  =  1-32 

a'  =  (1-3)2. 

d.  The  Theil-Wage  Linear  Growth  Model 

The  Theil-Wage  Predictor  restricts  the  Holt- 
Winters  Parameters  so  that 

a  =  (i-h)  '    a   =  ha. 
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III.   RELATIVE  FORECAST  MODEL  PERFORMANCE  COMPARISON 

A.    COMPARISON  METHODOLOGY 

1 .   Time  Series  Generating  Model  Specification 

When  forecasts  of  a  particular  time  series  are  at- 
tempted, the  forecaster's  belief  concerning  the  underlying 
generating  process  governs  his  selection  of  a  specific  fore- 
cast model.   Although  recognizing  that  few  models  completely 
describe  the  complexity  of  practical  economic  or  physical 
processes,  by  his  selection  of  a  forecast  model  the  fore- 
caster thereby  indicates  that  the  time  series  has  an  under- 
lying stochastic  process  whose  functional  form  is  suggested 
by  the  forecast  rule.   The  adaptive  parameters  chosen  for 


-il"  X      ^^j^^-^j_j-^       Lil€      -a  ^>  *j  col*  \^  v*.     HO  1  Jit     OI      Cxi  6      t  viiv  i  u 


ting  process.   The  degree  to  which  the  actual  underlying 
process  differs  from  the  assumed  form  is  reflected  in  the 
forecast  accuracy  obtained  by  the  forecast  model. 

2 .   Forecast  Model  Specification  Error  Measurement 
Any  comparison  of  relative  forecast  accuracy  is 
fundamentally  an  attempt  to  determine  which  of  the  forecast 
models  is  the  more  accurate  specification  of  the  process 
generating  the  series  being  forecast.   The  measure  of  speci- 
fication error  used  in  this  comparison  will  employ  the  method 
discussed  by  Bossons  [Ref .  13] ,  who  defined  specification 
error  as  the  additional  variance  of  forecast  errors  introduced 
by  misspecif icat ion  of  the  generating  process.   This  measure 
cannot  be  determined  when  comparison  is  attempted  using  actual 
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time  series  data,  since  the  precise  underlying  model  is  un- 
known.  Since  the  various  series  examined  in  this  compari- 
son have  been  generated  from  known  process  models  for  which 
the  optimal  forecast  model  and  parameters  are  also  known, 
it  is  possible  to  determine 

VAR(Yt-Y£*) 
f  =  VAR(Yt-Y£)     X 

as  a  measure  of  the  model  specification.   If  {Y  }  is  the 
series  being  forecast,  then  {Y**}  is  the  series  of  forecasts 
generated  by  the  forecast  model  whose  associated  specifica- 
tion error  is  being  measured,  and  {Y*}  is  a  series  of  optimal 
(minimum  asymptotic  variance)  forecasts  for  the  series  {Y  } . 
ThQ  lo^'e7*  bound  on  f  is  zero,  and  is  obtained  when  {Y**}  = 
{Y*}.   No  upper  bound  on  f  exists.   Positive  f  values  reflect 
the  extent  to  which  a  particular  forecast  series  has  been 
degraded  by  misspecif ication  of  the  underlying  stochastic 
process.   This  measure,  as  Bossons  [Ref.  13]  has  observed, 
has  two  uses.   First,  it  permits  the  effect  of  a  known  mis- 
specification,  such  as  a  model  simplification,  to  be  measured 
This  emphasizes  the  relative  importance  of  various  coeffi- 
cients or  parameters  in  the  model,  and  its  sensitiveness  to 
change.   Second,  it  permits  the  robustness  of  a  forecast 
model  to  be  measured  by  reflecting  the  forecast  accuracy 
for  various  types  of  series  misspecif ications ,  such  as  cor- 
relation, linearity,  or  for  specific  distributions  of  the 
random  variables  in  the  process. 
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3 .   Forecast  Model  Performance  Criterion 

The  measure  of  model  specification  error  permits 
a  preference  ordering  or  ranking  of  forecast  models  to  be 
made  for  any  given  time  series  generation  model.   It  is 
often  true  that  a  collection  of  these  series  must  be  fore- 
cast, and  few  of  the  series  are  represented  by  any  one 
generating  model.   It  would,  of  course,  be  possible  to  anal- 
yze all  the  series  and  group  them  according  to  these  process 
relationships,  and  for  some  processes  this  expense  is  justi- 
fied.  For  many  series  however,  as  mentioned  in  the  introduc- 
tion to  this  study,  it  is  not  critical  that  the  forecasts  be 
exceedingly  accurate,  and  the  forecast  costs  must  be  kept  to 
a  minimum,  consistent  with  the  benefits  obtained  from  such 
forecasts.   An  additiona]  means  uj  measurement,  therefore, 
is  needed  to  permit  a  preferred  ranking  of  forecast  models 
over  a  collection  of  series  to  be  forecast.   It  may  be  true 
that  one  or  two  forecast  models  may  provide  the  necessary 
accuracy  over  the  entire  collection  of  series.   However,  it 
is  recognized  that  this  is  extremely  situation  dependent. 
The  method  used  in  this  study  to  suggest  possible  "best" 
forecast  models  for  use  in  predicting  a  collection  of  dis- 
similar time  series  is  the  calculation  of  the  average  speci- 
fication error  over  all  series  forecast,  and  the  sample 
variance  of  the  specification  error.   Selection  of  a  model 
based  on  the  estimated  mean  specification  error  would,  of 
course,  imply  that  a  few  forecasts  which  were  grossly  erro- 
neous could  be  tolerated,  while  selection  based  upon  esti- 
mated specification  error  variance  would  suggest  that  a 
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uniformly  high  level  of  accuracy  was  not  required  but  that 
no  extremely  poor  forecasts  were  acceptable.   Some  combina- 
tion of  the  two  criteria  may  even  be  considered.   The  con- 
clusions of  this  study  will  be  restricted  to  rankings  in 
terms  of  the  mean  and  variance  of  specification  error.   No 
valid  general  interpretation  can  be  given  to  these  conclu- 
sions, since  they  are  applicable  only  for  the  class  of 
models  and  distributions  chosen  for  use  in  this  study. 

The  method  would,  however,  be  applicable  to  the 
relative  comparison  of  a  group  of  forecast  models  when 
forecasting  actual  time  series,  if  one  were  willing  to 
assume  that  the  particular  model  with  the  least  mean  square 
forecast  error  was  "optimal"  for  a  particular  series.   Even 
with  bueujin.ai.iuii  error  isascci  only  upon  l..c  best  model, 
rather  than  upon  a  truly  optimal  (minimum  variance  error) 
model,  it  would  still  be  possible  to  draw  meaningful  con- 
clusions concerning  the  relative  effectiveness  of  a  group 
of  models.   The  same  approach  would  apply  when  searching 
for  a  general  purpose  parameter  for  use  in  a  single  model 
which  must  be  applied  to  a  collection  of  series.   It  is  un- 
likely that  an  optimal  parameter  for  one  series  will  be  op- 
timal for  all.   Use  of  this  method  on  a  representative  sample 
of  series  would  facilitate  selection  of  the  parameter  which 
minimizes  the  average  forecast  error  variance  or  some  other 
selected  criterion  for  the  entire  collection  of  series,  and 
not  just  for  one  particular  series. 
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4 .   Time  Series  Data  Generation  Methods 
a.   Parameter  Selection 

For  each  of  the  forecast  models  discussed  in 
Section  II  a  time  series  was  generated  using  the  underlying 
process  generation  model  associated  with  that  specific  fore- 
cast rule.   The  random  shock  forms  of  those  rules  only  re- 
quire that  the  parameters,  such  as  intercept  and  slope,  be 
specified  and  that  the  random  shock  be  added  to  provide  the 
stochastic,  element  for  the  series.   It  was  determined  during 
the  course  of  the  study  that  the  variance  of  the  random  ele- 
ment caused  little  or  no  change  in  the  specification  error, 
so  no  attempt  was  made  to  produce  results  over  a  range  of 
model  parameters.   The  reason  for  this  was  that,  even  though 
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changed  in  generally  the  same  proportion  for  all  and  the  ratios 
remained  approximately   constant.   In  no  case  would  the  rank- 
ing have  changed  due  to  the  parameters  selected.   Parameters 
used  to  obtain  the  representative  results  contained  in  Tables 
I  and  II  may  be  determined  by  consulting  the  computer  program, 
b.   The  Random  Number  Generator 

The  random  number  generator  upon  which  all  sto- 
chastic series  properties  are  based  is  a  function  called 
URN  which  is  contained  in  the  Naval  Postgraduate  School  IBM 
360-67  computer.   This  additive  generator  was  selected  as  a 
standard  instead  of  available  multiplicative  generators  or 
the  conventional  RANDU  since  it  was  almost  three  times  faster 
than  RANDU  and  had  been  subjected  tc  statistical  tests  which 


were  on  file  at  the  computer  facility  at  USNPGS.   As  a 
known  quantity,  questions  involving  this  generator  and  its 
effect  on  results  should  be  quickly  resolved  without  ad- 
ditional statistical  testing  of  the  generator. 

c.  Normal  Random  Number  Generation 

The  normal  random  numbers  used  with  the  Least- 
Squares  Models  were  generated  from  the  uniform  (0,1)  URN 
output  by  using  the  Central  Limit  approach  outlined  by 
Naylor  [Ref.  8]  on  pages  92-93.   This  amounted  to  summing 
twelve  uniform  (0,1)  numbers  and  subtracting  six  to  produce 
a  normal  (0,1)  number. 

d.  Uniform  Random  Number  Range  Transformation 
The  uniform  random  numbers  were  transformed 

to  various  ranges  Cpage  79,  Ref.  H)    using  the  expression 

Y   =  A  +  (B-A)e   0  <  £  <  1 

which  is  a  rearranged  form  of  the  uniform  cumulative  distri- 
bution function.   Since  A  =  -B  in  this  study,  this  expres- 
sion can  be  written  as 

Yt  =  (2et-l)B  =  (et-0.5)2B 

which  is  the  form  used  in  the  Fortran  program  to  transform 

the  random  numbers  to  desired  ranges. 

5 .   Single  and  Group  Series  Forecast  Performance  Com- 
parison 

a.   Single  Series  Comparison 

After  a  time  series  was  produced  from  one  of  the 

generating  models  in  the  computer  program,  all  forecast  models 
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were  used  to  forecast  each  observed  series  value,  given  the 
past  values.   After  each  forecast  the  forecast  error  was 
stored  and  after  all  models  had  completed  forecasting  the 
series,  the  average  forecast  error  and  the  sample  variance 
of  this  error  were  calculated.   From  this  the  sample  es- 
timate of  the  specification  error  (f)  discussed  in  Section 
2  above  was  computed  for  each  model.   Table  I  contains  data 
for  representative  comparison  made  using  this  method  of 
measurement . 

b.   Forecast  Comparisons  Using  Several  Series 
When  the  single  series  comparisons  had  been 
completed,  the  average  specification  error  and  estimated 

c  juh'O  2.  °     Variance     v  p  ■>"  e     *  V\  o  n     r^l  n:1  stpH     ■f-mr     r>r!f~h      fnrpra^f     IUO  d  C 1  « 

This  is  the  performance  criterion  discussed  in  Section  3. 
As  noted  there,  the  results  may  be  interpreted  only  in  re- 
lation to  the  specific  series  combination  examined.   Table 
III  contains  representative  data  for  comparison  of  series 
using  this  criterion. 

6 .   Forecast  Model  Stabilization  and  Operation 

A  stabilization  period  of  100  observations  was  used 
to  remove  any  effects  due  to  improper  starting  conditions 
and  then  the  forecast  error  was  compiled  over  the  next  300 
observations.   The  specific  initial  conditions  for  each  model 
which  were  used  to  obtain  the  representative  results  in 
Tables  I  and  II  may  be  determined  by  consulting  the  computer 
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program.   Sufficient  comments  have  been  provided  to  assist 
in  identification  of  program  components  and  the  variables 
used  in  each.   Although  the  results  represent  only  sample 
estimates  of  average  forecast  error  and  forecast  error 
variance,  the  number  of  observations  forecast  was  considered 
large  enough  to  provide  a  reasonable  estimate  of  the  actual 
values . 

B.    COMPUTER  COMPARISON  ANALYSIS 

1 .   Prior  Performance  Expectations 

The  forecast  models  compared  were  selected  with 
certain  a   priori    relative  outcomes  in  mind.   Due  to  the 
nature  of  the  assumptions  on  the  models  and  the  generated 
<z(*-}ttp<z    n <; <; nri  at ed  with  those  models  it  was  anticioated  that 

a.  each  model  would  be  "optimal"  (have  the  least 
estimated  mean  squared  error)  for  its  corresponding  time 
series . 

b.  the  modified  least-squares  forecast  model  would 
produce  smaller  forecast  errors  than  the  simple  least- 
squares  model  due  to  the  correlated  nature  of  most  series 
used  in  the  study. 

c.  the  forecasts  generated  by  the  simple  EWMA 
model  would  tend  to  lag  behind  the  linear  growth  processes 
by  a  relatively  constant  amount. 

d.  the  Cox-modified  EWMA  would  tend  to  track  the 
linear  models  better  than  the  unmodified  model. 

e.  the  Holt-Winters  model  would  probably  perform 
better  overall  than  any  of  the  models  above  since  it  is 
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suitable  for  both  the  autocorrelated  data  and  for  the 
linear  model  used  in  five  of  the  generating  processes. 

f.  the  Theil-Wage  model  would  have  the  same 
general  characteristics  as  the  Holt-Winters  model,  but 
probably  provide  better  response  to  process  changes  due 

to  their  use  of  a  more  "simultaneous"  approach  (use  of  the 
most  recent  slope  rather  than  the  use  of  the  slope  calcu- 
lated last  period)  than  that  used  by  Holt  and  Winters. 

g.  the  Brown  model  would  have  generally  the  same 
characteristics  as  the  Holt-Winters  model,  but  not  perform 
quite  as  well  due  to  the  restrictions  which  Brown  placed 

on  forecast  parameters.   It  is  argued  by  some  [Ref.  7]  that 
Brown's  more  parsimonious  model  can  achieve  essentially  the 
same  results  in  practice  as  that  prov:,cc^  by  either  1-h^  Holt- 
Winters  model  or  the  Theil-Wage  model.   Quantitative  evi- 
dence to  support  or  refute  this  claim  was  one  expected  result 
of  this  investigation. 

2 .   Forecast  Model  Performance  Comparisons 
a.   The  Least- Squares  Series 

The  least-squares  forecast  model  performed  bet- 
ter than  all  others  in  predicting  this  series,  as  anticipated 
Comparison  of  the  process  variance  of  9.13  (see  Table  II) 
with  the  forecast  error  variance  of  8.98  tends  to  confirm 
that  the  least- squares  forecast  model  produced  optimal  pre- 
dictions.  The  next  best  forecast  model  for  this  series  was 
Brown's  double  smoothing  model  with  an  11.41  larger  fore- 
cast error  variance. 
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It  may  be  observed  (Table  I)  that  the  modified 
least- squares  model  was  ranked  fourth  after  the  Holt-Winters 
model.   The  unnecessary  correction  for  correlation  in  the 
random  shocks  degraded  the  quality  of  forecasts  generated 
by  this  model.   Note  also  that  the  simple  EWMA  forecast 
model  lags  the  observations  by  3.036  (see  Table  I).   This 
was  expected  since  this  model  specifies  that  the  forecast 
for  the  next  period  (or  any  future  period)  is  the  current 
level  of  the  process,  and  slope  of  the  linear  model  was 
selected  to  be  3.0.   The  EWMA  model  was  thus  never  able  to 
anticipate  the  change  due  to  slope.   This  is  the  refinement 
incorporated  into  Brown's  double  smoothing  method  and  it 
appears  to  be  effective  in  that  model. 

D.   ihe  Modified  Leas  i  -Squat  t-^  3e^  ie> 

The  results  for  this  series  were  much  the  same 
as  for  the  simple  least- squares  series  except  that  the  least- 
squares  models  reversed  their  roles.   The  modified  least- 
squares  model  produced  approximately  the  same  forecast  error 
variance  (8.98)  from  the  correlated  series  as  the  least- 
squares  model  had  obtained  previously  on  the  uncorrelated 
series  (see  Table  II),  and  the  least-squares  model  produced 
almost  the  same  specification  error  for  this  series  as  the 
modified  model  had  obtained  when  forecasting  the  uncorrelated 
series.   The  change  from  uncorrelated  to  correlated  normal 
random  shocks  affected  the  other  models  in  varying  degrees 
(see  Table  I) .   The  least-squares  model  and  the  Brown  model 
appear  to  possess  equivalent  capabilities  to  forecast  the 
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correlated  series,  while  the  modified  EWMA  model  continued 
to  be  the  least  desirable  for  use  in  forecasting  the  series. 

c.  The  Simple  EWMA  Series 

The  simple  EWMA  model,  although  its  forecast 
error  variance  was  not  as  low  as  the  series  variance  (see 
Table  II),  gave  the  least  forecast  error  variance  for  this 
series.   Almost  all  models  seemed  to  be  capable  of  forecast- 
ing this  series  adequately,  but  the  Theil-Wage  model  was  the 
least  desirable  by  far.   The  non-linearity  of  this  series 
became  noticeable  in  the  relatively  poor  performance  of  the 
least-squares  model,  but  the  effect  was  not  pronounced  due 
to  the  limited  range  of  the  random  walk  in  the  series  gener- 
ating model.   The  modified  least- squares  model  was  more 
cHiMuit  o i  iorccsstms  lui a  scries  cut©  lO  jl.^>  a.u x x ±  l. _v  <_  c 
use  the  added  information  contained  in  the  series  correla- 
tion. 

d.  The  Modified  EWMA  Series 

The  modified  EWMA  forecast  model  developed  by 
Cox  generally  provided  the  least  forecast  error  variance, 
but  it  may  be  observed  (Table  I)  that  the  modified  least- 
squares  model  produced  almost  the  same  results  for  this 
series.   The  modified  EWMA  model  does  not  share  this  versa- 
tility when  forecasting  the  modified  least-squares  series. 
The  next  ranked  model  in  terms  of  least  forecast  error  was 
the  simple  EWMA  model,  but  its  error  variance  was  18.4% 
greater  than  the  modified  EWMA  model.   The  modified  least- 
squares  model  provided  a  0.006  specification  error  during 
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this  particular  comparison,  but  on  others  it  reflected  a 
slight  negative  error  implying  that  it  performed  better  than 
the  "optimal"  model.   Based  on  the  simulation  results  there 
appears  to  be  no  real  difference  in  forecast  accuracy  re- 
gardless of  which  of  these  two  models  were  used.   Reference 
to  Table  II  reflects  that  these  two  models  were  able  to  re- 
duce forecast  error  variance  substantially  below  that  ob- 
served in  the  series.   This  suggests  a  true  predictive  cap- 
ability not  possessed  by  the  other  models  whose  forecast 
error  variances  were  each  approximately  equal  to  or  greater 
than  the  series  variance. 

e.   The  Holt-Winters  Linear  Growth  Series 

The  Holt-Winters  forecast  model  achieved  the 

mance  was  only  slightly  better  than  that  shown  by  the  Brown 
model  (see  Table  I).   A  few  comparison  trials  have  resulted 
in  the  Holt-Winters  model  obtaining  a  smaller  forecast  error 
variance  for  the  Brown  growth  series,  and  the  Brown  forecast 
model  demonstrates  a  similar  capability  to  perform  better 
than  the  Holt-Winters  model  on  the  Holt-Winters  series. 
These  outcomes  were  regarded  as  sample  variations,  but  the 
implication  is  obvious.   The  results  of  these  models  are  so 
comparable  that  it  would  be  difficult  to  conclude  that  any 
real  difference  existed  between  them  (for  this  series) .   The 
next  best  performance  (the  Theil-Wage  model)  resulted  in  a 
72.61  increase  in  forecast  error  variance. 
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f.   The  Theil-Wage  Series 

The  Theil-Wage  forecast  model  proved  to  be 
distinctly  optimal  for  forecasting  its  assumed  underlying 
process.   Previous  comparisons  have  shown  that  the  various 
observed  series  could  have  been  generated  by  several  slightly 
different  models,  as  evidenced  by  the  comparable  forecast 
performance  of  the  several  forecast  models.   Here,  however, 
the  next  best  forecasts  were  obtained  by  the  simple  EWMA 
model  with  almost  four  times  the  forecast  error  variance. 
The  minimum  error  variance  of  18.14  (see  Table  II)  exhibited 
by  the  Theil-Wage  forecast  model  is  much  larger  than  the 
series  variance  of  7.99,  but  it  compares  favorably  with  the 
predicted  theoretical  mean  square  error  of  19.7  obtained  by 
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g.   The  Brown  Linear  Growth  Series 

The  results  obtained  while  forecasting  this 
series  tend  to  further  reinforce  the  observations  made  con- 
cerning the  Holt-Winters  results.   The  Brown  double  smooth 
ing  model  obtained  the  least  forecast  error  variance,  but 
the  Holt-Winters  model  only  slightly  exceeded  that  minimum 
value.   Other  models  tended  to  forecast  this  series  some- 
what more  accurately  than  the  Holt-Winters  Series,  but  the 
same  performance  ranking  resulted  (see  Table  I) . 
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IV.   CONCLUSION  AND  RECOMMENDATIONS 

A.    CONCLUSIONS 

1 .   Validity  of  Assumptions 

Throughout  this  thesis  it  has  been  stressed  that 
the  final  conclusions  would  necessarily  be  general  in  na- 
ture.  The  assumptions  which  led  to  these  conclusions  must 
not  be  overlooked,  for  some  of  them  may  place  significant 
restrictions  on  the  applicability  of  the  methods  studied. 
If  these  assumptions  are  changed,  introducing  other  dis- 
tributions or  parameters,  the  methodology  introduced  here 
is  equally  valid  in  those  circumstances. 

Some  specific  assumption  which  are  thought  to 
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a.  The  composition  of  the  selected  group  of  fore- 
cast models.   Some  models  not  considered  here  may  have  proved 
superior  to  all  of  those  selected.   The  conclusions  there- 
fore are  applicable  only  to  the  specific  set  of  forecast 
models  treated. 

b.  The  specific  parameters  selected  for  each 
forecast  model.   Winters  and  Brown  have  recommended  certain 
values  for  use  with  their  models  for  general  purpose  ap- 
plication.  No  such  recommendation  was  found  for  the  Theil- 
Wage  model,  for  example.   A  specific  series  was  generated, 
and  the  optimal  parameters  for  that  series  were  used  for 
all  series.   Perhaps  a  more  general  set  of  parameters  for 
the  Theil-Wage  model  exists  which  would  have  made  its  per- 
formance s  iperior  to  the  ether  models. 

74 


c.  The  use  of  the  uniform  distribution  and  the 
range  selected  for  each  may  have  had  significant  impact 

on  forecast  model  performance.   These  distributions  do  not 
necessarily  represent  any  physical  or  economic  process. 

d.  Finally,  it  must  be  remembered  that  these 
conclusions  result  from  simulated  time  series  data  where 
the  generating  parameters  were  accurately  known,  and  no 
sweeping  claims  can  be  made  for  the  results  of  any  simula- 
tion. 

While  it  is  perhaps  proper  to  be  skeptical  of  the  specific 
conclusions  to  be  made  here,  it  is  again  suggested  that 
much  better  assumptions  can  be  made  in  the  context  of  a 
problem,  and  when  this  comparison  is  repeated,  the  conclu- 
5 ions  ai  that  time  should  be  Tim  significant  and  of  Drac- 
tical  value  to  the  forecaster. 

2 .   Conclusions  and  Applications 

Based  upon  the  representative  results  in  Tables  I 
and  II,  and  the  analysis  in  Section  III,  B,  the  following 
conclusions  are  made  concerning  forecast  model  performance 
for  the  tested  series: 

a.   The  use  of  the  least-squares  model  and  the 
modified  least- squares  model  for  forecasting  the  Holt-Winters, 
Theil-Wage,  and  Brown  series  gave  poor  results,  primarily  be- 
cause these  series  tend  to  be  non-linear  for  many  of  the 
chosen  ranges  of  the  distribution  of  the  random  variables. 
In  attempting  a  "non-discounted"  linear  fit  to  data  which 
appear  quadratic  over  large  periods,  enormous  errors  result. 
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The  lesson  here,  if  one  may  be  found,  is  that  a  linear  fore- 
cast model  such  as  least  squares  must  not  be  applied  to  data 
which  have  a  tendency  to  be  non-linear.   When  such  a  case  is 
suspected,  the  "discounted"  linear  models  are  to  be  prefer- 
red, since  they  tend  to  fit  the  data  only  locally,  with  the 
more  distant  observations  given  relatively  little  weight. 

(b)   The  performance  of  the  Holt-Winters,  Brown, 
and  modified  least-squares  linear  models  on  stationary  (no 
trend)  data  is  good,  so  unless  no  question  exists  as  to  the 
constant  level  of  the  random  process  it  would  appear  desir- 
able to  use  a  linear  model.   In  the  event  that  a  trend  oc- 
curs, the  model  will  follow  it  well,  and  if  not,  it  will 
still  give  good  forecasts.   If  a  series  similar  to  the 

IIiOCtj.rj.CC    L.  i'lilA     j-3     j.ii\Ci.y      tO     uC     Cn^L/uutt  1  cu  ,      ljic     muu  i  ntu 

least-square  is  to  be  preferred  over  the  Holt-Winters  or 
Brown  models.   This  model  is  distinctly  superior  to  all 
others  when  only  the  first  four  series  in  Table  I  are  con- 
sidered.  One  reason  for  its  superiority  over  the  standard 
least-squares  model  is  that  the  latter  can  generally  only 
be  expected  to  have,  at  best,  a  forecast  error  variance 
equal  to  the  variance  of  the  process  about  its  mean.   This 
is  due  to  the  model's  attempt  to  forecast  the  mean  value 
of  the  series  (the  least- squares  regression  line).   The 
modified  form  further  extends  the  simple  least-squares 
model  and  adds  a  correction  factor  based  on  the  known  cor- 
relation and  the  last  forecast  error.   With  this  added  in- 
formation its  forecast  capability  is  much  improved.   That 
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this  is  only  true  when  the  random  shocks  are  correlated,  how- 
ever, may  be  seen  from  Table  I. 

c.  The  Modified  EWMA  Model  was  designed  to  permit 
Wiener's  linear  predictor  to  adapt  to  changes  in  process 
level.   It  did  not  appear  to  achieve  this  goal  very  effec- 
tively, since  the  simple  EWMA  model  performed  better  on 
every  series  except  the  one  for  which  the  modified  model  was 
intended  to  be  optimal.   The  data  tend  to  suggest  that  the 
Cox  modified  model  is  a  special  one  with  limited  adaptive 
capability  when  applied  as  a  result  of  misspecif ication  of 
the  underlying  model.   This,  of  course,  is  not  in  agreement 
with  the  prior  expectations  of  this  model  as  stated  in  Sec- 
tion IV  B.  1,  but  after  some  reflection  is  not  very  sur- 

change  in  the  mean  level  of  a  random  walk.   The  rapid  changes 
which  occur  in  the  linear  model  would  not  normally  be  expec- 
ted to  occur  in  a  random  walk  process,  or  the  walk  would  lose 
its  random  property.   Therefore  the  model  was  not  intended 
to  follow  a  linear  trend,  and  without  further  modification 
should  not  be  used  to  attempt  this. 

d.  The  results  in  Table  I  lead  to  a  conclusion 
that  the  Brown  and  Holt-Winters  models  are  comparable,  with 
Brown's  model  showing  slightly  better  overall  performance. 
This  tends  to  support  Harrison's  claim  [Ref.  7]  that  Brown's 
model  is  preferred  in  practice  due  to  its  simpler  construc- 
tion but  comparable  result. 

e.  The  measure  suggested  earlier  in  this  study 
of  an  average  specification  error,  and  specification  error 
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variance  in  selecting  the  best  models  tends  to  bias  selec- 
tion in  favor  of  the  Theil-Wage  forecast  model  (see  Table 
III).   The  reason  for  this  is  that,  although  it  did  not 
perform  as  well  on  all  other  series  as  for  instance  the 
Brown  model,  the  other  models  almost  completely  failed  to 
follow  the  Theil-Wage  series.   As  a  result,  the  variance 
and  average  of  specification  error  were  inflated  for  all 
other  models.   The  uniqueness  of  the  Theil-Wage  series,  as 
evidenced  by  the  relatively  poor  showing  of  other  forecast 
models  in  predicting  this  series,  is  an  excellent  example 
of  the  need  to  make  such  model  comparisons  as  this  study 
has  done.   If  one  were  restricted  to  using  the  other  fore- 
cast models  for  predicting  this  series,  it  might  be  thought 
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accurately  forecast  by  any  model.   When  such  a  series  oc- 
curs in  practice  and  none  of  the  standard  models  seem  to 
apply,  a  comparison  of  widely  assorted  forecast  models  may 
suggest  a  more  appropriate  form  for  use. 

An  application  of  the  comparison  methodology  pre- 
sented in  this  thesis  might  be  made  by  a  supply  item  manager, 
who  is  responsible  for  forecasting  demands  for  stocked  items 
and  insuring  that  stockouts  do  not  occur  more  frequently  than 
some  specified  rate.   The  manager  could  select  sample  demand 
data  which  was  representative  of  his  stockage  items,  or  if 
the  differences  in  demand  distributions  for  some  items  were 
too  great,  he  might  form  two  or  more  homogeneous  groupings 
from  which  representative  series  were  taken  and  select  a 
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model  for  each.   The  forecast  models  could  then  be  used  to 
forecast  the  selected  series  and  the  measures  of  specifica- 
tion error  calculated  using  the  least  forecast  error  vari- 
ance model  which  resulted.   Of  course,  the  forecast  technique 
in  use  at  the  time  of  the  comparison  should  also  be  included. 
If  the  method  is  a  good  one,  this  comparison  will  demonstrate 
clearly  how  good  it  is  in  terms  of  the  added  forecast  error 
involved  in  using  other  less  applicable  methods.   The  inter- 
pretation of  100  times  the  specification  error  as  the  added 
percentage  of  forecast  error  variance  caused  by  model  mis- 
specification  should  be  easily  understood  by  all  those  in- 
volved in  model  selection.   This  approach  is  recognized  to 
be  strictly  empirical,  but  for  mass  producing  forecasts  on 
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not  be  justified  in  many  cases. 

In  the  event  that  no  model  appears  clearly  superior  to 
others,  the  comparison  results  still  permit  an  intelligent 
model  selection  procedure.   Consider  the  results  in  Table  I. 
It  is  evident  that  when  the  linearity,  serial  independence 
and  normality  assumptions  are  satisfied  the  least  squares 
method  should  always  be  used.   Uncertainty  about  the  satis- 
faction of  these  assumptions  poses  an  interesting  decision 
problem.   If  the  least  squares  forecast  model  is  used  and 
the  generating  model  of  the  forecast  series  is  actually  as- 
sociated with  one  of  the  other  forecast  models,  the  forecast 
error  variance  "penalty"  for  use  of  the  least  squares  method 
ranges  from  an  additional  18.3%  minimum  to  over  650,000%. 
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On  the  other  hand,  if  one  were  to  select  the  Brown  model, 
the  "penalty"  would  range  from  11.4%  if  the  least  squares 
model  should  have  been  chosen,  up  to  a  maximum  of  over  2300% 
in  a  worst  case.   For  six  of  the  seven  possible  series, 
however,  the  Brown  model  would  result  in  less  than  a  25% 
penalty.   The  "max-min"  solution  suggested  by  Table  I  is 
the  selection  of  the  Theil-Wage  model,  which  never  results 
in  a  forecast  error  variance  penalty  greater  than  113%  or 
just  over  twice  the  error  variance,  no  matter  which  series 
is  forecast.   It  may  be  noted  that  this  model,  though,  al- 
ways causes  at  least  a  60%  penalty  unless  it  happens  to  be 
the  optimal  model.   If  the  forecaster  had  reasonably  good 
information  on  the  likelihood  of  occurrence  of  the  various 
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variance,  a  decision  rule  could  be  formulated  to  guide 
the  selection  of  a  preferred  forecast  model. 

An  additional  application  which  is  actually  only  a 
variation  of  the  approach  used  in  this  thesis,  is  the  modi- 
fication of  various  generating  models  to  introduce  varying 
degrees  of  autocorrelation  (or  lagged  variables  in  the  least 
squares  models)  to  determine  their  relative  effects  on  the 
forecast  models.   Previous  studies  have  shown  a  negative 
bias  in  variance  of  least  squares  parameter  estimates  caused 
by  autocorrelation.   A  negative  bias  in  the  estimates  them- 
selves occurs  when  3  >  0  in  the  lagged  variable  case.   These 
results  may  be  shown  analytically  (p  211-221,  [Ref.  12]). 
The  studies  have  gone  on  to  show,  through  simulation,  that 
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a  strong  positive  bias  occurred  when  autocorrelation  and 
lagged  variables  were  both  present.   Such  investigations  of 
model  interactions  could  make  use  of  the  methodology  de- 
scribed in  this  thesis  as  a  quantitative  measure  of  results, 
and  further  add  to  the  present  understanding  of  economic 
time  series  data. 

B.    RECOMMENDATIONS 

1 .   Extensions  of  this  Investigation 

Some  recommended  extensions  of  this  thesis  which 
may  result  in  improved  forecasting  models  or  a  better  fore- 
cast model  selection  criteria  are: 

a.  A  determination  of  the  effect  on  the  results 
of  this  studv  of  distributions  other  than  uniform. 

b.  Investigation,  along  the  lines  suggested  by 
Cox  [Ref.  5],  of  the  advantages  of  replacing  Y   in  Equation 
(C-3)  by  some  other  form  of  moving  average. 

It  would  be  interesting  to  determine  whether  re- 
quirements such  as  minimizing  the  effects  of  long  term  trends 
could  be  used  to  choose  between  these  alternative  forms  of 
moving  average.   Limited  tests  at  the  close  of  this  study, 
where  Brown's  double  smoothing  model  was  substituted  into 
Cox's  modified  model  instead  of  the  simple  EWMA  model  nor- 
mally used,  resulted  in  significant  improvements  in  fore- 
casts of  series  from  the  linear  generating  models.  (Excluding 
the  Theil-Wage  series,  the  average  specification  error  was 
reached  from  1.58  to  0.58.) 

c.  Investigation  of  "tracking  signals."   When  such 
misspecification  occur  such  as  usin?  the  Holt-Winters  model 
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to  forecast  the  Theil-Wage  series,  a  model  modification 
which  has  been  suggested  by  some  [Ref.  6,  16,  17,  18)  is 
the  use  of  a  tracking  signal  to  serve  as  the  "exception"  to 
management  that  the  forecast  model  is  not  able  to  perform 
satisfactorily.   The  forecast  error  tolerance  of  this  sig- 
nal may  be  set  at  any  level  needed.   Such  signals  may  be 
used  to  obtain  the  attention  of  management,  or  as  brought 
out  in  the  references  given,  to  adjust  the  smoothing  con- 
stants to  accommodate  the  series.   This  procedure  probably 
has  the  most  potential  for  handling  the  wide  assortment  of 
random  process  forecasts  which  is  often  required.   The  cal- 
culation of  "optimal  parameters"  is  obviously  not  a  solution- 
due  to  the  variety  of  series  encountered  in  practice.   Em- 
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as  Winters  and  Brown  have  done,  fails  when  gross  specifica- 
tion error  occurs,  but  a  combination  of  a  "good"  parameter 
and  a  tracking  signal  adjustment  would  appear  to  be  an  ef- 
ficient solution  to  the  problem.   As  an  extension  to  this 
thesis  it  would  be  interesting  to  determine  the  degree  of 
improvement  gained  by  the  addition  of  tracking  signals 
to  the  models.   Although  such  signals  have  generally  been 
discussed  in  the  context  of  EWMA  models,  the  concept  very 
likely  could  be  applied  to  least  squares  models  and  improve 
their  performance  in  those  cases  where  the  model  assumptions 
are  violated.   Successful  application  of  such  refinements 
would  contribute  substantially  to  a  more  "automatic"  fore- 
casting system  and  increase  the  population  of  time  series 
for  which  any  particular  model  may  be  successfully  applied. 
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2 .   Application  of  Methodology 

While  it  might  be  premature  to  suggest  that  any 
actions  be  taken  on  the  sample  results  obtained  in  this 
thesis,  it  is  maintained  that  the  procedure  used  is  sound, 
and  could  be  applied  immediately  to  a  practical  problem 
using  actual  data.   Slight  modifications  must  be  made  to 
the  Fortran  program  such  as  substituting  "read"  statements 
for  the  series  generators  and  a  logical  check  added  to 
determine  the  model  with  least  forecast  error  variance  for 
a  specific  series.   The  program  would  then  give  results 
which  could  be  readily  interpreted  by  forecast  personnel. 
It  is  strongly  recommended  that  NavSup  or  any  other  agencies 
using  forecast  models  give  consideration  to  use  of  this 
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model.   If  the  quantitative  measure  obtained  justifies  con- 
tinued use  of  the  same  model,  then  the  procedure  may  be 
repeated  using  only  versions  of  the  same  model  with  varied 
parameters  as  a  sensitivity  evaluation  on  the  model.   The 
small  amount  of  computer  time  required  to  perform  a  com- 
parison (less  than  30  seconds  on  an  IBM  360-67)  or  sensi- 
tivity analysis  is  trivial  compared  to  the  potential  increase 
in  effectiveness  of  forecasts  if  it  is  discovered  that  some 
other  model  or  combination  of  models  are  more  satisfactory 
predictors  of  the  random  processes  of  interest. 
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A    COMPARISON    n^    SHO^T    TERM    FORECAST    MODELS 

THIS    PROGRAM    GENERATES    SEVEN    FORMS    OF    TIME    SERIES    WHICH 
ARE    EACH    f=0DECAST    BY    SEVEN    FORECAST    MODELS,     ONE    OF    WHICH 
IS    THE    OPTIMAL (MINI MUM    MEAN    SQUARED    ERROR)    PREDICTOR    OF 
THE     SERIFS.       THE     SPECI " ICATION    ERROR    IS    COMPUTED    for 
EACH    FORECAST    MODEL.    FOR    EACH    TIME    SERIES    FORECAST,       THE 
AVERAGE    SPECIFICATION     ERROR    MEAN    AND    VARIANCE    ARE    THCvJ 
CALCULATED    AS    AN    OVERALL    MEASURE    OF    FORECAST    MODEL    PE^F. 

DI MENS! ON    OPSN(5  01 )  ,RNUM(500 ),OIFLSE(500 ) , CIFSEA( 500) , 
1DI FMEA< 5  00),DI-HLT( 500) ,01 -P°N(5D0) , SUMSQ< 7) , 
2HLTLVL(5  0C ) , ACOR F( 500 ) , DI FF( 500  ),DI  =  MLS( 5  00)  , 
3DIFTHL( 5  00)  ,  THLLVL (500)  jSPEO^        ( 7 ,7 ) ,C 0M3MU  (7  ) , 
4SUMER( 7),VARSPC( 7) 

LOWLUP    VAI  U-    IS    TMf'    NUMBER    OF    INITIAL    cORE<~AST    PERIODS 
WHICH    APE    DISIGNATED    C0R    FORECAST    STABILIZATION.    AFTER 


THIS    DERIOD,FCST     ERROR    CONTRIBUTES    TO    VAR. 


.ULATION. 


LOWLUP    SHOULD    *!0T    B' 
L0WLUP=100 


LESS    THA'J    3 


MAX  LIP    IS    THE    TOTAL    NUMBER    OF    OBSERVATIONS    GENERATED (OR 
READ    IN    IF    THE    PROGRAM     IS    ^DIFI:D    TO    USE    ACTUAL    DATA). 
MAXLIP    SHOULD    NOT    FXCEED    500    WITHOUT    PGM    MODIFICATION. 
MAXLUP=400 

THIS    INITIALIZES    THE    U(0,1)    GENERATOR 
X=URN(-5) 

•NUMBER'     SP~CTFIES    THE    TYPE    SERIES    TO    BE    GENERATED.     'DO 

1000*     STE-S    THROUGH    ?>LL    S~RIES-    1=    L  SE     SERIES,    2  =    MOD    LS' 

SERIES.    3=    SIMPLE    EWMA    SERIES,    4=    MOD    EWMA    SEP  IE S    (EXPO 

AUTOCORRELATEC),     5=    HOLT-WINTERS    LINEAR    GROWTH    SERIES, 

6=    THEIL-WAGE    LINEAR    GROWTH    SERIES*     7=    BPOWN    LINEAR 
r  p  n  ;.:  t  u     c;  c  o  t  •;  g 

vu     i  u  u  t      «u™Dr.  "  ~1 ,  I 

0BSN(1 )=0.0 

RNUM(l) =0.0 

GO    TO     ( 11,66,22, 33, A4, 77,55) , NUMBER 
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THIS    MODEL    GCN^RAT-S    A    SERIES    f:OP     WHICH    A    ZERO-INTERCEPT 
LINEAR    FORECAST    MODEL     IS    OPTIMAL 


•B«     IS    TH« 
11    B=3.0 


SLHPF    VALUE    TO    BE    SELECTED    AS    DISIR? 


RNS^G    IS    THE     STD    DfcV    Oc    NORMAL    RV 

RNGSIG=3.0 
THIS    DO    LOOP    GENERATES    THE    DESIRED    NUMBER    OF    RNS 

DO     100    I=1,MAXLUP 

SUM=0.0 

DO    20    .1=1,12 
THIS    IS    THE    UNI  F(  0,1)    RN    GENERATOR 

X=UP.N(2  ) 

R=  X 
20    SUM=SUM+R 

GENcRATOR    WHICH    PRODUCES     A    NORMAL  (0,X)    RANDOM    NUMBER. 

RNUM  ( I  )  =  (  SUM-6  .0)*PNGSIG 
THIS    CGMPUTES    THE    DETERMINISTIC    °ART    OF    THE    LINEAR    MODEL 
AND    ADDS    THE    NORMAL    RV    TO    IT. 

OBSN(  I  )  =  B*I+P.NUM(  I  ) 
100    CONTINUE 

GO    TO    6  0 
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THIS    MODEL     TAKCS    SAME    NORMAL    RN»S    USED    ABOVE    AND    GENERATE 

4    FIRST    0^ nER    AUTOREGRESSIVE    SERIES    FOP    WHICH    THE    MOD 
LEAST    SQUARES    c OR  EC  A ST    IS    OPTIMAL. 
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66    RH0=0.4 

EC5S  =  0.0 

DO    101     !=1,MAXLUF 
THIS    GENERATES    CORRELATED    RANDOM    SHOCKS.    SEE 

EPS=PHO*EPS+RNtJMd  ) 
THIS    GENERATES    THE    DETERMINISTIC    PART    OF 
AS    BEFORE,     BUT    NOW    THE    CORRELATED    SHOCK 

OBSM( I ) =B*!+EPS 

RNUM< I )  =  EPS 
101    CONTINUE 


THE 
IS  * 


EO.  ( A 

LINE  A 


in: 


DEO. 


-4) 

R  MODEL 


GO  TO  60 


,',.'>  O  -  -■-  . 
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THIS    MODEL    GENERATES    A    SERIES    FOR    WHICH    THE     SIMPLE    EWMA 
FORECAST    MODEL     IS    OPTIMAL. 

•RNGGAM'     IS    THE     RANGE     3F    RANDOMNESS    ASSOCIATED 
WITH    THE    SHOCK    TO    PROCESS    LFV;:L       SHOULD    BE     SET    LESS    THAN 
THE     DEGREE     ,P.NG, ASSOCIATED    WITH      THE    RANDOM  -SHOCK    ADDED. 
22    RNGGAM=5.0 

DEGREE    0=    RANDCMMPSSf RNG     »OF    UN  I F    RV    SHOULD    ALSO    BE    SET 

RNG=12. 0 

MU=0.0 
THIS    IS    A     UNIc(-X.O,X.O)     PNDM    GENERATOR.     LOOP    PROVIDES 
THE    NUMBER    OF    PNS    DESIPFD. 

DC    300     1=1 , MAXLUP 

X=UPN( 2) 

RNUMd  )=(X-C5  )*RNG 

IMA    PROCESS    IS    GENERATED    USING    THE    RANDOM    SHOCK     FORM 
OF    THE    MODEL    SHOWN    IN    E0.(B-4). 

OBSNd  )=MU  +  PNU*MI  ) 

X=URN(2) 

BIT=( X-0.5)*?NGGAM 

MU=MU+BIT 

r,;,::i;j/  t  >  —  C  M|  IM(  T  ixDT  T 
_>  u  u     oun  i    iivlic 

GO    TO    60 
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THIS    MODEL    GENERATES    A    SERIES    FOP    WHICH    COX'S    MODIFIED 
FORECAST    MODEL     IS    OPTIMAL.     COP    C0EFF=RH0*#LAG.    HERE    LAG=1 

»RH01«     IS    THE    DESIRED    CORRELATION    COEFFICIENT 
33    RH01=0.4 

CRHO=l .O-RHCl 

UNI RNG= 10.0 
THIS  IS  A  Uk'Ic(-Xt.X)  RNDMN  GENERATOR.  LOOP  GENERATES 
THE  PROPER  NUMBER  OF  RMS  DESIRED. 

DO  200  1=1 ,MAXLUP 

X=URW( 2) 

RNUMd  )=(X-0.5  )*UNIRNG 

0BSN(I+1)=3H01*0RSN(I )+CPH0*RNUM( I ) 
200  CONTINUE 

GO  TO  6  0 
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THIS  MODEL  GENERATES  A  SERIES  FOP  WHICH  THE  HOLT  TWO-PAP* 

METER  MODEL  IS  OPTIMAL.  SLOPE  EXPERIENCES  A  RNDM  CHANGE. 

THERE  IS  ^LSO  /  PNDM  MOVEMENT  AROUT  TH£i  LEVEL  OF  THE 
PROCESS  CCNTRIn:JTING  TC  THE  FORECAST  ERROR. 


THE    DEG 


r'  r. 


0C    ^aND0'vNESS    must    BE     SET    FOR    THE 


RANGE    0r    THE    FORECAST    ERRCR, 
44    RNGEPP =10. 0 

SL0PE=1  .0 

VECMU=0.0 

PCSLVL  =  0.0 

ALFA=0.2 

ALFA1=0.1 

DO    4C0    !=2,MAXLUF 

X=URM(2  ) 
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-C.5 
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ST  E 
SLVL 
U+SL 

r  i  w 

E  +  *L 

N  :W 

E    RM 

MU+S 


E    U, 

)  *  p  n 

nBS 

f  "   : 

+RNU 
OPi  + 
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-X,X>    FCST    ERROR 
G  EP  P 

EPVEO    VALUE    OF    THE    PROCESS    WITH    A 
INCLUDED. 

V  (  T  ) 

ALFA*RNUM( I ) 

UE    OF    SLOPE    DUE    TO    RNDM    PROCESS    CHMG 

LFA1*RNUM( I ) 

EL    C0R    SERIES    BASED    ON    LATEST    EST    OF 

HAMGE    FOR    THE    PER  I  OH 


x:  $  £:  *  j| 
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THIS    MODEL    GENERATES    A    SERIES    FCF     WHICH    THE    THEIL-WAGE 
FORECAST    MODEL     IS    OPTIMAL. 

THE    RANGE    OF    UNCERTAINTY    OF       FCST    ERROR    AND    SLOPE       CHMGE 
MUST    BE    SET    TO    PROPERLY    SPECIFY    THE    UNDERLYING    PROCESS. 
77    YLVL=0.G 

SL°=0.0 

EPSRNG=10.0 

DELRNG=4.0 
THE    GENERATING    MODEL    CORRESPONDS    TO    EO .    ( D- 1 ) 

DO    450    I =1  ,W*XLUF 

X=URN( 2) 

BITl=(X-0.5 )*D£LRNG 

SLP=SLP+BI TI 

YLVL=YLVL+SLD 

X=UPN(2 ) 

RI T2=( X-0.5>*EPSPNG 

OPSNd  )=YLVL+P  TT2 

PN'JM(  I)=8IT1+BIT2 
4  50    CONTINUE 


***  -nr  *r  -r-  -r  - 


,  f.  *f*  *-.  jp  *-,-.  ^f  *-f 


A.  JU  v,   J. 


J  >v  -•- 


THIS    MODEL    GENERATES    A    SERIES    FOR    WHICH    BROWN'S    QNJ 
PARAMETER    MODEL    IS    OPTIMAL 


SET. 


THE    FOLLOWING     INITIAL    CONDITIONS    AND    PARAMS    MUST    BE 
55    RNGFRR=10.0 

ALpA-^0.  1 

BS LOPE- 1.0 

BRNLVL=0.0 

FCSBPN=0.0 

BETA=1.0-ALFA 

BETASQ=BETA*36TA 

ALFASQ=ALci*iLz* 

DO    500    I=1.MAXLUP 

X=URN(2> 

RNUMd  )  =  (  X-C.5)*RNGEPR 

OBSN(  I  >  =  FCSBRN  +  RNUMU  ) 
THIS    CONTRIBUTES     \    RANDOM    CHANG"    TC    PROCESS    LEVEL 

BRNLVL=8RNLVL+3SLOPE+( 1. O-BETASO) *RNUM( I ) 
THIS    CONTRIBUTES     \    RAN  COM    CHANGE    TO    SLOPE 

8  SLn^E  =B SL  0PE+ \ L -  A  SC* R  NUM ( I ) 
THIS    LEVEL     CONTR I BUTE  S    TO    NEXT    PERIOD    OBSERVATION 

PCSBRN=RRNLVL+BSLOPE 
5  CO    CONTINUE 

O,   »'*   -*U    -i^    -J,    .1.,    »',     Jt*   "A.  «J-     Jm    "X    -J-   »•-     •'-    -V     *JU  -A-  JL>     .'.'-•,.         -J.    -J.     ,i,   J.     ..'.,,     .1.    ^    .   ,    yl-    J,   v-    J,    .1.    .  ,    ^    vl,   v»   %'.-    O,    •.'»     -J-    -'.     v'»    -.'  -   -J-    ~>  -   *-•-    -1*  •«'  '   «*-   -  *-    -* 
T"  ■*>**  1*   ***  T*  -C  *C    *<*  *i*  ^*    ■*«*   l*    *TI*  -V    T    V   *»'  T  t"i'   1*  V  1*T   T  T  T  "V*  *Y*  *  •■  -f*  <»**  1*  >(»  ^r*  *  »  -nr*  *f>   '.*  m*    -"I*  *i*   "V   *y«.  **    ^^^y***^  "T*  *".  *  "lv    "V1  •"*  •   "*t^  ■"»""  "«  ■ 

THIS      SECTION         COMPUTES    MEAN,    VARIANCE    I    AUTCCCRRELATI ON 


OF    THE    OBSERVATIONS 

OBSERVATIONS,    DH01  , 

60    NOBSN=MAXLUF 
MA XL A G= 5 

K  =  2 
3    rO    T"»     (5,4,99)  ,K 
5    K  =  3 


THE    CORRELATION    BtzTWEFN    SUCCESSIVE 

IS    US  HD    IN    SUBSEQUENT     FORECAST    MODELS 
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WRITE  (6t  2499)N!JMBFR 
2499    FORMATdH    ,/////,  10X  ,«  ST  AT  1ST  ICAL    CHARACTERISTICS    OF1, 
1«    THE    GENERATED    SERI6 S* t 5X, ■ NUMBER=    ',13,//) 
GO    TO    9 
4    K  =  l 

VP  ITE(  6,?4S8)N'J*D-FR 
2498    FORMAT (1H    ,/////. 10X, « STATI STI CAL    CHARACTERISTICS    OF1. 
1«     TH^    PNDM    SERIES1 ,5X,,NUM9ER=    ',13,//) 
XSUM=0.0 

DO    12    I=1.MAXLUP 
12    XSUM=XSUM+RNUM(I ) 

X9AR=XSUM/(NOBSN*1.0) 
VAR=0.0 

DO    21    I=1,NCBSN 
DIFF(  I  )  =  RN'JM(I  )-X3AR 
21    VAR=DIFFd  )**2+VAR 
GO    TO    2  4 
9    XSUM=0.0 

DO    10    I=1,MAXLUF 
10    XSUM=Q3SN< I )+XSUM 

X B AR=X  SU M/  ( .\]GB  SM  -1.0) 
V4R=0. 0 

DO    23    I=1,N0BSN 
DIFF( I )=OBSN(I )-XBAR 

23  VAR=DIFF( I)*DI?F(I )+VAR 

24  VAR=VAP/(NG"SN*1.0) 

C       VAR    IS    VARIANCE    0^    AUTOCOVARI ANCE    FCTN    WITH    ZrPG    LAG. 

25  DG    40    J  =  l, MAXL VG 

C       LIMLUP    IS    THi    UPPER    LPno    LIMIT    WHICH    DECREASES    WITH    LAG. 

LIMLUP=NG3SN-J 

ACVF=0.  0 

DO    30    I=1,LI*LUP 

XPROD=DIFF( IJ*DIFF(I+J) 
30    ACVF  =XP-CD+^CVF 

ACVF         =ACVF  /(NOBSNd.O) 

C       ACVF    Tc    THE    AUTOCOVAP I  A^-'CE    FUNCTION    W«TH    '  AG    .h 

C       ACORF    IS    THE    AUTOCORRELATION    FCTN,    Tmc    NORMALIZED    ACVF. 
WRITE(6,25C1)J,AC JRC( J) 
2501    F0RMAT(1H, /10X, •AUTOCORRELATION    COEFFICIENT    WITH    LAG* 
1  ,13,'=    ■  ,F7.5) 
40    CONTINUE 

RH01=AC0RF( l ) 


WRITE ( 6 ,26G0 ) XB AR , VAR, RHG1 
26C0    FORMA T( 1H,/5X, • X3AR=' ,F10. 5 , 5X , ■ VAR=« , F10 .5 ,5X , ' 


RH01=' 


C 
C 

c 
c 
c 
c 


1  ,F7.5,//) 

GG    TG    3 


C 
C 


r 
c 


j-  •**»  -A.  «(,  y„  ^>»  ■hK*  „i<  -ju  s>  ju  -J*  *V  -*'  «  *■*-  -^*  *V  -'*  V-  *^  -J*  *"*■  ->  *■'*  •*-  ***  V'  *V  -**  *  -  ■^  -1*  *  *  *^  ■»*•  *•* 
-v  *r  *v  'i*  n*  *i-  rr  nr  *c  n*  *»*  -i-  ^*  -,-.  ^  -p.  t-  ;f  -)~  ?,£.  *|*  -.-  -r  -t*  'i*  -»■»  *i*  j<*  -r  t  nr  r  t  'i*  t  t  *»* 

THIS    IS    THE    LSE    FORECAST    MODEL    (ZERO    INTERCEPT    FORM) 

THESE    IMITJ4L    CONDITIONS    SPECIFICALLY    ANTICIPATE    THE 
LSE    SERIES    GENERATED    EARLIER. 
99    A HA T= 0.0 

SUMXY=B 

SUMXSQ=1.0 

DIFLSF (  1)=0.0 

FCSLSE=3*2.C 

FCSMLS=0.0 

RH0=0.4 

DG    600    I=2,MAXLU? 
FORECAST    !FRRO^     IS    CALCULATED    AND    ACCUMULATED    FCR    LATER. 

DIFLSE(  I  )=-rSLSr:         -n^SNU) 

DIFMLS( I )=FCSMLS-OBSN(I) 
BEFORE    COEcFICT£NTS    CAN    BE    ESTI MATED , XBAR    AND    YBAR    MUST 
BE    CALCULATED. 

SUMXY  =  S!JMXY+:*03SN(  I  ) 

SUMXSQ=SUMXSO+I*I 

BHAT=SUMXY/SUMXSQ 
NOW    THE    FCST    CAN    BE     MADE, USING    THE    COEFFIC IENTS, AHATSBHAT 

FCSLSE=AHAT  +  SHAT*( 1  +  1  ) 
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c 

c 
c 
c 
c 
c 


c 
c 

r 

c 
c 

c 


THIS    IS    THE    MODIFIED    LEAST    SQUARES    FORECAST    MCDEL. 
SINCE    IT    USES    THE    SAME    DARAM£TEP     ESTIMATES    AS    THE    LSE 
MODEL    PLUS    A    CORRECTION,     ITS    FORECASTS    ARE    GENERATED    AT 
THE    SAME    TIME    AS    THE    LS'f    MODEL.     THIS    IS    THE     SAME    AS    50. 
(A-6),rXC:PT    SIGN    CHANGE    DUE    TO    DIFFERENT    PCST    ERROR    FORM 
FCSMLS  =  cCSLSi£-cHO*DIcLSE(I  ) 
600    CONTINUE 


(-    - 1       -',     JL.   -J,    o-     -'.     Ju.  ^"-    -1-    -*-   -A*   ■*•    -'  -    vU   -o    ,o    J,    „»-    O.-   -J,   <*»   -,'».'-    *.«,   -V    <JL>    —    Ok,    -A- 

v*  'r  or  or  or  i"  or  or  or  *i*  or  or  or*  or  or  or  -r  or  *p  or  or  or  op  i-  op  op  or  or  or  or 

THIS     IS    THE    SIWPLh    EWMA    FORECAST    MODEL    WITH    OPTIMAL    ALFA 

^CSl    IS    THE     INITIAL    FORECAST    NEEDED    TO    START    THE    SIMPLE 
EXD3MEMTI ALLY    WEIGHTED    MOVING    AVERAGE    FORECAST    SERIES 

FCS1=0.0 

ALFA=( 3.0*RH01-1.0) /(2.0-RH01) 

IF(PH01.L:f.C3333)     ALFA  =  0.075 

BETA=1. 0-ALFA 

DO    700    I=1,MAXLUP 

CIFSEA( I )=FCS1-0BSN(I ) 
THIS    GENERATES    FO-ECAST    FOR    NEXT    PERIOD    USING    SIMPLE    EWMA 

FCS1=ALFA*0BSN ( I )+BETA*FCSl 
7  00    CONTINUE 


c 

c 
c 
c 

«J?  ***■*  •*■  **••  iV  -V  *v.  -•*  -V  •***  -J-  »*-  -^  »*,  O-  »''  V*  -**■  «A*  "J-  J-«  Of  *^  -C-  •fc»-  %»*.  o..  *u  »u  -•-  -a*  *j*  «.«.,  „■-  u*.  %v  «•*-  »■>  -A*  -'■'  "J*  »V  V-  V*  -J-  JU  ■»«*  -X.  JU  <■**  -.',  O-  OL>  .'-  v*  -•*  -V 
or  or  or  'i*  or  ->  *r  *r  -r  *  *  op  or  or  or  or*  ***  ^*  'i*  t  f  Is  '^  t  'r  '»»  -r  or  or  or  op  "**  or  *P  **r*  or  op  or  or  or  or  or  •  r  or  -a*1  -r  or  or  t  or  or  'r  or  op  *r  o>  -t~*  or 

THIS    IS    THE    MODIFIED    EWMA    FORECAST    MODEL    PROPOSED    BY    COX 

ALFA=0.01 

BETA=1 .0-ALFA 

FCSEM/=0-0 

CO    800     I=2,MAXLUF 

c 

THIS    GFNcPATES     FORECAST     F0F    N:XT    PERIOD    USING    SIMPLE    EWMA 

c 

AND    OPTIMAL    ALFA    ASSUMING    EXPONENTIAL    AUTOCORRELATION. 

FCSEMA         =^LFA*0DSN< I- 1 )+B£TA*FCSEMA 

c 

THIS    US-S    SIMPLE    EWMA    IN    COX'S    MODIFIED    EWMA. 

FC  SMEA          ■- '"' I  0 1  *0  3SN { I -  1)  +  (  1     0—  F  HO  1  ': ::::::<"  Sc  M  * 

0 1  F  M  £  A  (  i  J  —  F  c  SM  E  A         -0  B SN { I  5 

8 CO    CONTINUE 

c 

c 
c 

c 

^r  *p  'i*  '»"  t  ix  *i"  V  -I*  'i*  'r*  '.'*  -.*  'r  *t  *ts  n*  ***  -r-  t-  t  -v  t  'p  -r  -i-  -v  -v  *i*  ir  -v  '.^    r*  '.*  v  -r  *?  >  *p  v  %*  -k*  v    .v  -t*  v*  '.-  n*  %s  *.'•  t-  v-  V  *r  -;*  -v  *i* 

THIS    IS    THE    HOLT    TWC-PARAMETE*    FORECAST    MODEL 

5iLcA  =  0.2 

BETA=1. 0-ALFA 

BSL0PE=0.0 

HLTLVLd  )=0.0 

FCSHLT=0.0 

ALFA1=0.1 

BETA1=1.0-ALFA1 

DO    900    T  =2,  MA  XL  UP 

CIFHLTC  I  )==CSHLT          -OBSN(I) 

c 

THIS    ESTIMATES    THE    CURRENT    PROCESS    LEVEL 

HLTLVL(  I  )  =  ALFA*0BSN(I  )  4-BHTA*  (MLTLVLt  1-1 )  +B  SLOPE  ) 

c 

SLOPE    IS    UPDATED    FOR    USE    ON    NEXT     ITERATION. 

B SLOPE         =ALFA1* (HLTLVLd  >-HLTLVL ( 1-1 ) )  +  BcTAl*BSLO»E 

c 

c0RECAST     IS    GENERATED    USING    CURRENT    LEVEL       AND       SLOPE    EST 

FCSHLT=HLTLVL( I )+BSL0PE 

9CC    CONTINUE 

c 
c 
c 
c 

-JU    -i-   v-       J  —'.     .>.    -■         ■      .'-    -        -i  -    ,■      -       -'       - '      ^'-    ■   .    -  i.    -i-    -J-    >■-    .(,    ,i,    .1.    OL-    -',  ^l-    J  •    J-    si,    -J*  -J  -    ,V    O-    ^',    -J.      ^C-  ~l-     -'.•    -'-    O         '^   O  -^i-    .'-    J.  O,    -X-  -«-   -•-  ->-   ••-  -«*  %•'  ~>'  >''  U» 

tttt  *i*  'i*  "i*  *i%  "»"  "**■  "*"  *r  '»"*'*  *r  or»  *ri"p  'i*  ">*  t  ^r  *>'■  or*  'i**  *r*  t*  -t*  t  i*  *i*  tt  t  t  t*  or*  *f  -r*  oi*  or-  or*  or  i*  *r  or  or  or  or*  or*  or  -o*  or  or  or  or 

THIS    IS    THE    THEIL-WAGE      FORECAST    MODEL- 

FCSTHL=0.0 

ALFA=0.5S6 

ALFA^=0.425 

BETA=1 .0-AL^A 

3  E  T A  D  =  1  .  C—  i.  L  c  A  ° 

FACALF=ALF>/(1 .0-BETA*ALFAP) 

FACBET=BETA*BETAP/( 1 .0-BETA* ALFAP ) 

THLLVL( 1)=0.0 

SLP=0.0 

DO    92  5    I  =2,MAXLUF 
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925 


DIFTHL(  T  )=FCSTHL-OBSN<  T  ) 

THLLVK  I  >=FACALF*DBSN(  I  ) +FACBET*( THLL VL  (  I-D  +  SLP) 

SLP=ALCA!J*(  THLLVK  I  )-THLLVL(I-l  ) )+BETAP*SLF 

FCSTHL=THLLVL( I )+SLP 

CONTINUE 

-     »*.     -<-    «',    ■.•-    »'«.   »'-    .'-    .L.      jL   <JU  »>-    iJb   JU   *V    »J*   -'-    ml*    .1-    -•,*>,,,    xL.    .»»   .  1^   JU   -  U,   *<-   iJU   O,   <J'   •»'  *    -.'-     ■'-    -1*    -'*     -1  -    -'  -     -  -       ^*     -*-    •&*    »'--''-•-.'.    ^'-    -■-   -^    -.'-    -*1-    -U    »'-   .J, 

**,-,-',--,■   t*  -i,  -    ■  ■   -i  -  i-  -».  -y  *p.  -y»  *^  ^  o^  <T"  'i-  *•*  f"i*  *v  t  -r  t»  *»*  *i*  'i-  *i*  •*»**.■'  -r-  -r*  *v  -r  *r  "st  *"r  'i-  i-TT'T'rT'',TTr  *v»  -**  i-  ***  - 


THIS    IS    BROWN'S    ONE-PARAMETER    FORECAST    MODEL 

ALFA=0.1 

BETA=1 .0-ALF4 

SNGLSM=0.0 

CBLSf,f]=0.0 

BSLOcF=0.0 

FCSBRN=0.0 

DIFPRN( 1  )  =  0.0 

FACTOR=ALF A/BETA 

pn    950    I=2TMAXLUP 

DIFR?*(  i  )=ECSBRN-08SN<  I  ) 
C       THIS    COMPUTES    THE    SINGLE    (1ST    ORDER) 

SNGLSM         =ALFA*0BSN< I ) +BETA*S 
C       THIS    COMPUTES    THE    2ND    ORDER    SMOG 

DBLSMO         =AL*=A*  SNGLSM         +  BETA 

SLDFE         =FACT0R*( SNGLSM         -DB 

FCSBRN  =2.0*SNGLSM         -DBL 

950    CONTINUE 
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EC  AST    ERRORS    AND    SQUARED    ERRORS 
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THIS    COMPUTES    ESTIMATED    FORECAST    ERROR    MEAN    ANT    VARIANCE 
FOR     EACH    FORECAST    MODEL. 

VARFAC=MAXLUP-L0WLUP-1 .0 

DO    4000    J=l,7 

SUMER  (J)=SUM'ER(J  )/  (  VAPFAC+1.0) 
40C0    SLMSK  J)=SUf-*SO(  J)/VARFAC 


C 

c 

r 


THIS    COMPUTES    THE    MEASUREMENT    OF    SPECIFICATION    EP^OR, 
THE    RATIO    OF    FORECAST    MODEL'S    FORECAST    ERROR    VARIANCE 
TO    THE    VARIANCE    OF    THE    OPTIMAL    MODEL    FOR    THAT    SERIES. 
PC    5000     J=l ,7 
50C0    S^ECr       (NUMBER , J) =( SUMS0( J ) /SUMSQ(NUMBEP ) )-l .0 

WRITE  (6,  5001  )(L,S?ECF       (NUMBER  .D.L.  SUM  SO  (  L)  .L.  SUMER 
1(L)  ,L  =  1,7) 
5001    FORMATC1H    ,/,5X,fSPEC    ERROR    (',12,'     ) =•  ,2 X  ,  - 10. 5 , 

1     5X,'V,>.r('  ,  12,  '     )=«  ,2X,C12.4,5X,  'AVG    ERR(  ',  12,  '     )  =  S 
22X,F10.5,/) 


THIS    PRTNTS    OUT    T^E    OBSERVATIONS    AMD    THE    FORECAST    ERRORS 
EXPEPIENCEC    BY    rACH    FORECAST    MOTEL. 
WRITE(6,2099 INU^BER 


2099    FORMATdH     ,//// ,7X , 'SUMMARY    OF    FORECAST    RESULTS    FO^ 
l'THE    GENERATED     SFRIES' , 5X, • NUMBER=    «,I3,///) 

WRIT E( 6, 2100) 
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21 


22 
10 


//,5X 
»5X, 


I 


00    F0RM\T(1H 

1  'DIFP    M    LS1"  S5X,  »DIFF 
2'DIt::F    HOLT     '  ,5X.  '01  F  = 

WRITE ( 6,2200) (OE SN  (  L ) ,DIf=LS 
1DIFMEAU  ),OIFHLT(L  )  »DI 
2MAXLUP) 
00    FORMAT < 1H     ,8(5X,  =  10.4) 
00    CONTINUE 
NUMB£R=7 
FNUM=NUMBFR*1.0 


S  OBSM' ,4X, •DIFF  L  SO. 


CS  OBSM' 
ScWMA1 ,5X, 

thill  •  ,5x, 

(L)  , 

FTHL(L) ,DI 

) 


i4X, 

MFW^A1 ,5X, 
BROWNS     /) 
DICMLS  (L)  ,CIFScA(L  ) 
FBRN(L)  «  L=LCWLUPi 


•DT^F 
•DIFF 


C 

c 
c 


THIS    COMPUTES    THE    AVERAGE    SPr.C    ERROR 
FORECAST    MODELS*  J,  OVER    UL    SERIES, I. 

DC    1100    J=l. NUMBER 

CCMSUM=0.0 

DO    1111    1=1, NUMBER 
1111    CO  MS' )  M=C  CM  SI I  M+  S  P  E  C  F  { T  ,  J  ) 

COMBMU( J)=CCMSUM/FNUM 


FOR    EACH    OF    THE 


C 

c 


THIS    CO'^PUT^S    THE    SPECIFICATION    ERROR    SAMPLE     VARIANCE 
FOR    EACH    FORECAST    MODEL, J, OVER    ALL    SERIES,!. 
ESQSUM=0.0 
DO    1112     1=1, NUMBER 
1112    ESQSUM=i£SQSUM+  (  SPECF  (j  , J)-COMBMU (J)  )**2 

V*RSDCU  >=ESQSUM/FNUM 
11  CO    CONTINUE 

THIS    SUMMARIZES    THE    SDECI FICATION    ERROR    DATA    pCP    COMPAR 
WRITF(6,3100) 

31C0    FORMAT (1-H    ,////, 20X, ■ LSL    MDL  •  ,  8X,  •  M.LSc    MDL*    7X,«EWM/ 
17X.«MEWMA    MDL1 » 6X, •HLT-WNT    MDL • ,4X, ' THL-WGE    MDL«.4X, 

DO    3111    1=1, NUMBER 
11    WRITE (6,3110)     !. (SPECF(I.J) ,J=l, NUMBER) 
10    FGPMAT(1H    ,       6X,  'SERIESP  ,12  ,'  )  ■  ,  2X  ,7  <  F9  .  5,  6X  )    //) 


31 
31 

30 


i  ;  n  T  -r 


rf       ::r:n;: 


31 


•j  i  J     r  U  K  i-i  A  T  \  1  n     ,/'//,  jluA, 

1 'COMPAR I SON' ,/) 

WRIT.'-(6,3120  >  <L.COM«MU(L  )  .VARSPC(L)  .L=l, NUMBER) 
20  FORMAT (1H  , 1  OX ,' FORECAST  MODEL  ( *  ,  12  ,  *  >  •  ,  5X,  F9  .  5,  10X, 
1F10.5) 

STOP 

END 
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